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Abstract. The Einstein Telescope (ET), the European project for a third-generation gravitational- 
wave detector, has a reference configuration based on a triangular shape consisting of three 


nested detectors with 10 km arms, where in each arm there is a ‘xylophone’ configuration 
made of an interferometer tuned toward high frequencies, and an interferometer tuned to- 
ward low frequencies and working at cryogenic temperature. Here, we examine the scientific 
perspectives under possible variations of this reference design. We perform a detailed eval- 
uation of the science case for a single triangular geometry observatory, and we compare it 
with the results obtained for a network of two L-shaped detectors (either parallel or mis- 
aligned) located in Europe, considering different choices of arm-length for both the triangle 
and the 2L geometries. We also study how the science output changes in the absence of the 
low-frequency instrument, both for the triangle and the 2L configurations. We examine a 
broad class of simple ‘metrics’ that quantify the science output, related to compact binary 
coalescences, multi-messenger astronomy and stochastic backgrounds, and we then examine 
the impact of different detector designs on a more specific set of scientific objectives. 
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1 Introduction 


Thanks to the extraordinary discoveries of the LIGO-Virgo Collaboration (LVC) in the last 
few years [1-7], the field of gravitational waves (GWs) is blossoming. After three observ- 
ing runs the current catalog of GW detections contains about 90 binary black hole (BBH) 
coalescences, which in the last run have been detected on a weekly basis, as well as two 
binary neutron stars (BNSs) and two neutron star-black hole (NSBH) binaries [5, 6]. These 
discoveries are already having a significant impact on our understanding of the population 
properties (see, e.g. ref. [7]) and provide first results on fundamental physics and cosmology 
(see, e.g. ref. [8, 9]). The tremendous potential of combining multi-messenger observations 
including gravitational waves has been shown by GW170817, the first gravitational-wave 
observation from the merger of a binary neutron-star system which was detected in all the 


electromagnetic bands from gamma rays (GRB 170817A), X-ray, ultraviolet-optical-near in- 
frared (AT2017gfo), to radio [4]. This event had important implications in many fields of 
astrophysics, from relativistic astrophysics, nuclear physics, nucleosynthesis in the Universe, 
and cosmology. 


However, the current infrastructures have intrinsic limitations, and for more than a 
decade, the European community has been preparing the jump toward a ‘third-generation’ 
(3G) European GW detector. The Einstein Telescope (ET) [10-12] is the European observa- 
tory designed to detect gravitational-wave sources along the cosmic history up to the early 
Universe. The US community is participating in the efforts toward 3G detectors, with the 
Cosmic Explorer (CE) project [13, 14]. Third-generation detectors such as ET and CE will 
provide an improvement in sensitivity by one order of magnitude and a significant enlarge- 
ment of the bandwidth, both toward low and high frequencies, and will have extraordinary 
potential for discoveries in astrophysics, cosmology, and fundamental physics. Building on 
many previous works, the science case for ET has been summarized in [15, 16], while a recent 
comprehensive study of the ET capabilities can be found in [17] and, for multi-messenger 
observations, in [18]. For CE and for a more general discussion of 3G detectors see also 
[19, 20]. 

'The current design of ET is based on several innovative concepts, with respect to the 
(second-generation) LIGO-Virgo detectors. In particular, ET is currently planned to be a 
single observatory, located 200-300 meters underground, in order to significantly reduce seis- 
mic noise; to have a triangular shape, consisting of three nested interferometers, providing 
redundancy, the possibility of resolving the GW polarizations, and a null stream, i.e. a com- 
bination of outputs where the GW signal cancels, that can act as a veto against disturbances; 
and it will feature a ‘xylophone’ configuration, in which each of the three instruments actu- 
ally consists of two interferometers, one tuned toward high frequencies, and therefore using 
high laser power, and one tuned toward low-frequency, working at cryogenic temperatures 
and low laser power. The first ET Conceptual Design Report dates back to 2011, and laid 
down the basic structure mentioned above. It has been updated in 2020, in the context 
of the successful proposal for including ET in the ESFRI Roadmap, the roadmap of large 
scientific European infrastructures.! However, a detailed comparison of different alternatives 
in the design and on their impact on the science output has never been performed. The 
concept of ET was proposed well before the first observations of GWs. Unlike then, all the 
knowledge about GW sources that has accumulated in recent years makes such a study with 
astrophysically motivated GW source populations possible, and mandatory, today. 


'The aim of this work is twofold. First, we want to examine the effect of variations in 
the geometry of the detector, comparing a single triangle to a configuration made of two 
well-separated L-shaped detectors, which we will refer to as the ‘2L’ configuration, while 
maintaining all other innovative concepts of the ET design (underground, cryogenic, and in 
a xylophone configuration). Second, we want to study what would be the impact on the 
science output if some aspects of the design should turn out to be difficult to achieve, at least 
in the first stage of operations (both for a triangle and for a 2L configuration). In particular, 
we will study the situation in which the low-frequency (LF) instrument is not operative, and 
we only have the high-frequency (HF) instrument in each interferometer. This will allow us 
to quantify the impact of the LF instrument on the expected science output. 


‘See http://www .et-gw.eu/index.php/relevant-et-documents for a collection of documents on the ET 
design study and science case. 


This study has been performed in the context of the activities of the Observational 
Science Board (OSB) of ET, which is in charge of developing the Science Cases and the 
analysis tools relevant for ET.? The study is structured as follows. In Section 2 we will 
introduce and motivate the different detector geometries considered and we will give the 
corresponding strain sensitivities. We will then discuss the impact of different choices of 
geometry and design on the science that can be made with ET. The coalescence of compact 
binaries, such as binary black holes (BBHs) and binary neutron stars (BNSs), is a primary 
target of 3G detectors. The crucial properties here are the detection rates, the range and 
distribution in redshift of the detected events, and the accuracy in the reconstruction of 
the source parameters. Properties such as detection rate, range and accuracy in parameter 
estimation have the advantage of being very general and requiring minimal model-dependent 
assumptions (which are reported in the paper). Therefore, they already provide a first 
solid understanding of the relative advantages/disadvantages of different configurations. In 
Section 3 we will then compare, from this point of view, the performance of the different 
ET configurations introduced in Section 2. We will first consider ET observing as single 
detector (whether in its triangle or 2L configuration), discussing BBHs in Section 3.1 and 
BNSs in Section 3.2, and we will then perform in Section 3.3 the same study when ET is 
part of a 3G detector network including one or two Cosmic Explorer detectors. The effect of 
the different choices of ET as single observatory on multi-messenger observations of BNSs is 
studied in Section 4. In particular, Section 4.1 discusses the sky-localization and pre-merger 
alerts, while Sections 4.2 and 4.3 evaluate, respectively, the prospects of detecting short y-ray 
bursts (GRB) and kilonovae associated with BNS mergers. 


Stochastic backgrounds of GWs are another primary target of ET. The effect of different 
configurations on stochastic searches is studied in Section 5. In particular, in Section 5.1 we 
discuss the sensitivity to stochastic backgrounds (as expressed by the power-law integrated 
sensitivity, PLS) of the different configurations, while the angular resolution obtained from a 
multipole decomposition is discussed in Section 5.2. A comparison of the different sensitivities 
with predictions for astrophysical backgrounds is performed in Section 5.3, while the impact 
of correlated noise is discussed in Section 5.4. 


Beside the study of these rather general metrics, it is also important to investigate 
how these differences in detection rates, parameter estimation, or sensitivity to stochastic 
backgrounds, reflect on specific scientific targets of ET around which the ET Science Case 
is structured [15], even if this, unavoidably, may introduce some extra model-dependent 
assumptions. In Section 6 we will then discuss the impact of the different configurations on 
a broad set of specific scientific targets of ET in the domains of astrophysics, fundamental 
physics and cosmology. Finally, in Section. 7 we will discuss the properties of the null stream 
for the triangular configuration and its potential benefits. A summary of the main results 
of this study is presented in Section 8 and, in particular, our conclusions are discussed in 
Section 8.3. More technical material is collected in four appendices; in particular, several 
Tables with useful figures of merits for BBHs and BNSs are collected in App. C. We also 
provide, at the end of the paper, a list of acronyms used in the text. 


?See https: //www.et-gw.eu/index . php/observational-science-board for a public repository of papers 
produced within the ET Observational Science Board. 


2 Detector geometries and sensitivity curves 


'The ET initial design included high laser power at the interferometer input, leading to high 
optical power in the cavity of the order of several MW, optimisation of signal recycling, 
frequency-dependent light squeezing, increase of the beam size, and heavier mirrors compared 
to 2G detectors. These specifications led to the so-called ET-B sensitivity curve [10]. This 
design, however, neglected the fact that high circulating power is difficult to reconcile with 
cryogenic test masses. This has led to a ‘xylophone’ concept, in which the detector in each 
arm is actually composed of two different interferometers, one optimized for low frequencies 
(LF) and one for high frequencies (HF); the LF interferometer has low power (since laser 
power is only needed to beat down the shot noise in the high-frequency range) and cryogenic 
mirrors, while the HF instrument has high power and mirrors at room temperature. This 
has led to the ET-C sensitivity curve and, after further refinement of some noise models, to 
the ET-D sensitivity curve [12]. The ET-B and ET-C sensitivities must be considered as 
obsolete, and should not be used. The ET-D sensitivity has been the basis for all recent ET 
studies. Actually, we will use a further refinement of the E/T-D sensitivity curve, recently 
developed in the context of the ET Instrument Science Board (ISB), see below. 

As far as the geometry is concerned, we will study triangular configurations as well as a 
network of two well-separated L-shaped detectors (that we will denote as the '2L' configura- 
tion) and will we examine different options for the arm length and, for the 2L configuration, 
different relative orientations between their arms. For a single triangular configuration, cur- 
rently the candidate sites are the Sos Enattos site in Sardinia, and the Meuse-Rhine three- 
border region across Belgium, Germany and the Netherlands. When considering a single 
triangle, we will locate it for definiteness in Sardinia, but there will be no significant dif- 
ference in the results considering ET in the Meuse-Rhine region. When considering the 2L 
configuration, we will locate one detector in Sardinia and one in the Meuse-Rhine region. 
However, at the current level of precision, the analysis will be valid, with minor changes, for 
any pair of interferometers located at a comparable distance. 

In the limit of coplanar detectors, the setting with parallel arms maximizes the sensitiv- 
ity to stochastic backgrounds and the range to compact binaries, while it is the least favorable 
in terms of the accuracy on angular localization and reconstruction of the distance of compact 
binary coalescences (or other individual sources). Conversely, the setting with arms at 45? 
is basically blind to stochastic backgrounds, while it maximizes the performances in terms 
of angular localization and distance of compact binaries, see Section 5.1. When taking into 
account the Earth's curvature, the relative orientation between the two detectors is usually 
defined with reference to the great circle that connects the two detectors [21, 22]. Denoting 
by 8 the angle describing the relative orientation of the two detectors, defined with reference 
to this great circle, 8 = 0? corresponds to the case where the arms of the two interferometers 
make the same angle with respect to the great circle, while 6 = 45° is when one of the two 
interferometers is rotated by 45° from the 8 = 0° orientation. For 8 = 45°, the sensitivity to 
stochastic backgrounds is minimized, to the extent that it becomes exactly zero in the limit 
[Ax — 0, where f is the GW frequency and Az the distance between the two detectors. 
In this work, we aim at studying the performance of the 2L configurations close to these 
limiting cases; however, setting exactly 6 = 45° is not a convenient choice because it would 
send practically to zero the sensitivity to stochastic backgrounds; instead, as we will see, 
even a small deviation from 6 = 45° allows us to reach an interesting sensitivity to stochas- 
tic backgrounds, without essentially affecting the performances for angular localization and 


parameter estimation of compact binary coalescences. 

We decide to focus on two examples for the relative orientation of the interferometers 
arms in the 2L configurations, representatives of choices favoring, respectively, either the 
angular localization and distance determination of compact binaries, or the detection of 
stochastic backgrounds, but avoiding the limit in which the sensitivity to compact binaries 
is optimized at the expenses of sending to zero the sensitivity to stochastic backgrounds. We 
thus consider a simpler definition of the angles that uses the north at the position of the 
detector in Sardinia for the arms of both detectors. Denoting by a the relative angle defined 
in this way, and by f the relative angle defined with reference to the great circle connecting 
the detectors in Sardinia and in the Netherlands, we have a ~ 6 + 2.51?.? We will study 
the cases a = 0° and a = 45°, that corresponds to introducing a small offset with respect 
to the ‘perfectly’ parallel (8 = 0°) and maximally misaligned (8 = 45°) configurations. The 
results for compact binary coalescence (CBC) parameter estimation depends minimally on 
the precise value of this offset, as long as this misalignment angle is small. We have indeed 
checked that the results shown in Section 3 are unaffected, for all practical purposes, by this 
small offset; e.g. the differences between the cases œ = 45° and the truly optimal orientation 
a = 47.51° would be invisible on the scale of the plots shown in Section 3, and are below 
the variability due to the specific sample realization of the population. In contrast, the 
sensitivity to stochastic backgrounds strongly depends on it, as we will discuss in App. B. 
In the following, we will refer to these choices as parallel (by which we mean o — 0?) and 
misaligned (a = 45°).* 

In particular, we will consider (with the definition of parallel and at 45° specified above, 
i.e. with respect to the north of the detector in Sardinia): 


1. a triangle with 10 km arms (the current baseline ET geometry) 
2. a triangle with 15 km arms 

3. a 2L with 15 km arms, with parallel arms 

4. a 2L with 15 km arms, with a relative orientation of 45? 

5. a 2L with 20 km arms, with parallel arms 


6. a 2L with 20 km arms, with a relative orientation of 45? 


We will also occasionally compare with the results that can be obtained with the most 
advanced network of 2G detectors (LIGO Hanford, LIGO Livingston, Virgo, KAGRA and 
LIGO India), in order to better appreciate the improvement due to 3G detectors. We use 
the publicly available best sensitivities that are planned to be achieved by LIGO, Virgo and 
KAGRA by the end of the O5 run [23]. We will denote this network LVKI O5. LIGO India is 
expected to join the 2G network late in this decade. Both the Virgo and LIGO collaborations 


3For the Sardinia site we take (latitude 40° 31’, longitude 9? 25’), while for the Meuse-Rhine site we take 
(latitude 50° 43’ 23", longitude 5? 55’ 14"). 

“If a 2L option should be retained, it will be important to perform a more detailed study of the trade- 
offs between accuracy in the localization of compact binaries and sensitivity to stochastic backgrounds, as a 
function of the misalignment angle; such a study, however, cannot be performed abstractly but, especially 
when discussing misalignment at the (1— 2)? level, will have to integrate constraints from the geography and 
the geology of the candidate sites. For instance, underground tunnels have to follow optimized paths avoiding 
as much as possible fractures or water springs. 
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Figure 1: Amplitude spectral densities (ASDs, single-sided), used in the present work for different 
scenarios. In both panels we show the ASD for an interferometer with 10, 15 and 20 km arms. 
The left panel shows the full sensitivity curves (obtained from the HF instrument and the cryogenic 
LF instrument, in a xylophone configuration); the right panel is the sensitivity obtained with the 
HF instrument only. For comparison, in both panels we also show the ASD of the 10 km ET- 
D interferometer (which, by definition, includes the full HFLF sensitivity). Note that the ASD are 
plotted without taking in consideration the angle between the arms and the number of interferometers, 
see the text. 


are currently discussing plans for further upgrades of the detectors to be implemented after 
the O5 run by developing and testing key technologies critical for the 3G detectors [24]. We 
will also show some results for a single L-shaped detector of 20 km (denoted as ‘1L’), that 
we will locate for definiteness in the Meuse-Rhine region. 

It is clear that the configurations considered have different financial costs, so some 
configurations might be obviously better than others. The aim of this work is to provide 
scientific input to a broader cost-benefit analysis.” 

In Fig. 1 we show the different noise Amplitude Spectral Densities (ASDs) used in this 
work to characterise the various detector configurations considered. In particular, in the 
left panel we show, for various arm lengths, the noise curve attainable in the full design, in 
which there is a xylophone configuration with an instrument tuned toward high-frequencies 
(HF) and one tuned toward low frequencies (LF), and the LF instrument works at cryogenic 
temperatures. In the right panel we show the ASDs resulting from exploiting only the 
HF instruments. These curves have been computed in the context of the activities of the 
ET Instrument Science Board (ISB).° In particular, the line corresponding to the 10 km 


? An analysis of the costs of the different configurations is well beyond the scope of this study. We note, 
however, that in the comparison between geometries, the total linear arm length is not the only relevant 
parameter for estimating the relative costs. For instance, another significant parameter is the diameter of 
the tunnels. In an L-shaped configuration there will be only one detector per tunnel, compared to two in 
the triangle configuration with nested interferometers. As a result, in an L-shaped detector the diameter of 
the tunnels will be smaller, approximately 6.5 m, to be compared with 8 m for the triangle. Already just for 
what concerns the relative excavation costs, several other factors must be included for a realistic comparison, 
such as the fact that, in the triangle configuration, extra tunnel length is needed to displace the end towers 
of a detector with respect to the input towers of the other, as well as differences in the number and size of 
auxiliary caverns, and in the tunnels serving them. All these aspects can only be evaluated with a detailed 
dedicated study. 

S'These curves are available at https://apps.et-gw.eu/tds/ql/?c=16492, see in particular the Annex 
files. They obtained with the help of PyGWINC package [25]. 


length in the panel labelled “HFLF cryogenic” updates the sensitivity curve known as ET-D, 
available at https://apps.et-gw.eu/tds/?content-3&r-14065, and shown as a reference 
as the black line in both panels of Fig. 1, and reflects more detailed data on the technology 
used in the ET-D design. It should be stressed that, in this phase where the design of ET 
is being more deeply analyzed, the sensitivity curves necessarily evolve, and will continue to 
evolve. The two curves shown, for the full sensitivity and for the HF instrument only, must 
be taken as representative curves, within other possibilities currently under study. Further 
intermediate sensitivities could also be studied (for instance, corresponding to a possible 
intermediate stage where the LF instrument is present, but operates at room temperature), 
and would give results in between those obtained with the HF-only and the HFLF-cryo 
sensitivities. 

When comparing a triangle to an L-shaped interferometer with the same instrument- 
noise ASD, one must take into account that the triangle is made of three nested components 
(a component being an LF and HF interferometer pair), with an opening angle of 60°. 
For the triangle configuration, one must then project the GW tensor of the incoming wave 
onto each of the three components [see e.g. eqs. (9)-(11) of [26] for explicit expressions], 
and then combine the results at the level of the signal-to-noise ratio (SNR) and parameter 
estimation to obtain the ET capabilities. The accurate evaluations of detection horizons and 
parameter-estimation capabilities require the above calculation of GW-tensor projections 
onto the three ET components, as is done for all the results of the present paper, and in 
all Fisher matrix codes mentioned below. However, to have a simple and approximate rule- 
of-thumb for comparing the SNR of a triangle to that obtained, for the same signal, by an 
L-shaped detector with the same ASD, one must first of all take into account that, for each 
given component, the opening angle gives an extra factor sin(60°) = 3/2, see again eqs. (9)- 
(10) of [26]. In a first approximation, one can also assume that the three components of the 
triangle see the signal with the same SNR (which, however, is only approximately true). Then, 
since the SNR of the three components of the triangle add up in quadrature, the SNR? in a 
triangle is approximately larger than the SNR? in a L-shaped detector with the same ASD, 
by a factor 3 x (V3/2)? = 9/4. For this reason, when comparing graphically the ASD of a 
triangle to that of an L-shaped detector, a more fair comparison is obtained dividing the ASD 
of the triangle by a factor 3/2. Since some source of noise, such as Newtonian noise, are such 
that the SNR scales as the length of the arms, in a first approximation one could further be 
tempted to estimate that, for a triangle, SNR œ L V/3sin(60°) = 1.5L, while for a 2L network 
with the same arm-length SNR œ L /2sin(90?) ~ 1.41L.’ However, one must be careful not 
to infer from this that a 2L and a triangle configurations with similar arm lengths necessarily 
perform comparably. The scientific performance is determined basically by the quality of 
parameter estimation, which does not depend just on the SNR. As an obvious counter- 
example, for an ensemble of compact binary coalescence signals, and fixed ASD, 2L parallel 
have a higher SNR than 2L at 45?; however, as far as parameter estimation is concerned, 
2L at 45? perform significantly better (as we will see explicitly for ET in Section 3), exactly 


"In fact, not all noise relevant for 3G detectors are such that the SNR scales linearly with L. Examples of 
noise whose SNR do not scale with L include coating thermal noise (since it also depends on beam size, which 
increases with L), shot noise (since one needs to adjust the arm-cavity finesse depending on L), controls noise 
(since L can change stability conditions of the opto-mechanical system, which requires adjustment of control 
filters) and suspension thermal noise (since one needs larger mirrors in longer detectors, which means higher 
suspended mass), see [27] for discussions. Therefore, the correct comparison is between detectors at fixed 
ASD, as discussed above, rather than at fixed L, and the scaling of SNR « L is a further approximation, to 
be used only for a first orientation. 
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Figure 2: Detection horizons for monochromatic populations of equal-mass non-spinning binaries, 
for the various detector configurations considered. For comparison, we also show in each panel the 
curve obtained using a triangular detector with the 10 km ET-D ASD (which, by definition, includes 


the full HFLF sensitivity). 
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Figure 3: Ratios of the detector horizons for the different geometries. Left panel: full HFLF-cryo 
sensitivities, normalized to the horizon for the HFLF-cryo sensitivity of the 10 km triangle (i.e. to the 
green line in the left panel of Fig. 2). Right: HF sensitivities (still normalized to the full HFLF-cryo 


sensitivity of the 10 km triangle). 


the contrary of what would be inferred using an SNR-only argument. Similarly, for angular 
localization, the baseline between the detectors is another crucial factor. A comparison 
between the performances of different geometries can only be obtained with a parameter- 
estimation study, as we will perform below, and cannot be obtained from considerations 
based only on the SNR. 

In Fig. 2 we show the corresponding detection horizons for equal-mass non-spinning 
binaries, as a function of their source frame total mass, for the various detector configurations 
considered. These are defined as the maximum redshift out to which a binary with the 
chosen characteristics, with optimal location and inclination, can be observed by a given 
detector configuration with a network SNR > 8. This computation is performed using the 
IMRPhenomHM waveform model [28], and accounting for the effect of Earth rotation, which 
can give a contribution in the range of masses typical of BNS systems. For comparison, we 
also show the results obtained for a triangular-shaped detector using the ‘older’ ET-D ASD, 


which reproduces the result in Fig. 1 of [29]. 

In Fig. 3, left panel, we show the ratios of the redshift horizons for three different geome- 
tries (triangle 15 km, 2L-15km-45? and 2L-20km-45°), all in their HFLF-cryo configurations, 
with respect to the 10 km triangle, also taken in its HFLF-cryo configuration. In the right 
panel we show the same ratios for triangle 15 km, triangle 10 km, 2L-15km-45? and 2L-20km- 
45°, now all in their HF-only configuration, again with respect to the horizon of the 10 km 
HFLF-cryo triangle. We can appreciate that, especially for total masses below ~ 10Mo, 
arm-length has a very important effect on the detection horizon: for these masses, the 15 km 
triangle has a range that, in terms of redshift, is larger by a factor 1.5 — 2.2 than that of the 
10 km triangle, and the 2L-15km-45? is larger by a factor 1.4 — 1.8; as we see from the right 
panel, in this range of masses this increase in arm-length would basically compensate the 
loss of the LF instrument (especially for the 15 km triangle). Once again, however, it is im- 
portant to recall that the detection horizon is determined just by the SNR. More important, 
for the science case, is the quality of parameter estimation. For instance, the 2L-15km-45°, 
compared to the 2L-15km-0°, has a smaller horizon reach, but better capacity to determine 
the parameters of the sources, as we will quantify in the next section. 


3  Coalescence of compact binaries 


In this section we study the impact of the different detector geometries and different ASDs 
on the detection and parameter estimation of BBHs and BNSs. Several parameter estimation 
codes, tuned toward 3G detectors, have been developed recently, in particular GWBENCH [19, 
30], GWFISH [31], GWFAST [17, 32], TiDoFM [33, 34] and the code used in [35]. In the context of 
the activities of the Observational Science Board (OSB) of ET, we have performed extended 
cross-checks between these codes, finding very good consistency. The results presented in 
this section have been obtained with GWFAST which, thanks to its vectorization properties, is 
quite convenient for dealing with the large number of runs required, while those presented in 
Section 4 have been obtained with GWFISH, which is also tuned toward multi-band and multi- 
messenger observations [18, 31]. These codes are based on the Fisher matrix formalism. The 
Fisher matrix formalism has well-known limitations (see e.g. [36, 37] for extensive discussion) 
but is currently the only computationally practical way of dealing with parameter estimation 
for large populations. We refer to the original papers [17, 19, 30-32] for technical details on 
the application of the Fisher matrix formalism to parameter estimation for 3G detectors. It 
should also be observed that 3G detectors will observe a large number of events with very 
large SNR. As we will see for instance in Fig. 4 below, ET, in any of its configurations, 
will detect O(10? — 10*) BBH/yr with SNR > 100. These events will largely dominate the 
performance of a detector configuration for most aspects of the science case, and for large 
SNR the Fisher matrix approach becomes more and more reliable. 

We use state-of-the-art waveforms and population models. For the waveforms, we use 
IMRPhenomXPHM for BBHs, which includes precessing spins and higher-order modes [38], 
and IMRPhenomD_NRTidalv2 [39, 40] for BNSs, which includes tidal effects. The parameters 
of the waveform are {Me, m, dL, 9, Q, t, V te, Be, X 1,2) X2,05 X1yys X2 X1,21 X2,z, M, A2}, where 
M, denotes the detector-frame chirp mass, 7 the symmetric mass ratio, dr, the luminosity 
distance to the source, 0 and ¢ are the sky position coordinates (defined as 0 = 7/2 — ô 
and $ = o, with a and 6 right ascension and declination, respectively), 1 the angle between 
the orbital angular momentum of the binary and the line of sight, Y the polarisation angle, 
te the time of coalescence, ©, the phase at coalescence, Xi, the dimensionless spin of the 


object i = {1,2} along the axis c = {x,y,z} of the coordinate system, chosen with the z axis 
pointing along the orbital angular momentum (we use the same conventions as in Section IID 
of [41]), and A; the dimensionless tidal deformability of the object i (which is present only 
for systems containing a NS). Instead of Aj, Ag, we will actually use the two combinations A 
and dA defined in [42], which have the advantage that A is the combination that enters the 
inspiral waveform at 5 PN, while 5A first enters at 6 PN. In particular, 


À- E [a +7 — 3152) (A1 + Ag) + V/A — 49 (1 + 99 — 1172) (41 — 42) . (3.1) 
For BBHs we will perform the inference on all parameters (except, of course, A1, A2, that 
vanish for BHs) and, as in LVK parameter estimations, rather than v and Xi, we will use 
zn (ie. the angle between the total angular momentum and the line of sight; note that 
this is the same as ; only in the absence of precession), the spin magnitudes and angles, 
namely xi, X2, 9s1, 052, ®JL, $1 5. For BNSs, instead, we include tidal deformability 
but, given the small expected values of their spin magnitudes, we only consider the aligned 
spin components in the analysis, thus performing estimation on x1, and xy». The labels 
' and ‘2’ always refer, respectively, to the heaviest and lightest component of the binary 
system. After the inversion of the Fisher matrix, we compute the sky localisation area for 
the events at 90% c.l., AO gos. As in [17, 18], we assume an uncorrelated 85% duty cycle 
in each L-shaped detector, and in each of the three instruments composing the triangle. All 
technical details of the inference process for the results of this section are as described in [17]. 

We generated the catalogs of binary neutron stars (BNSs) with the population synthesis 
code MOBSE |43, 44]. MOBSE evolves isolated binary stars until they possibly become binary 
compact objects. For the models used here, we have evolved 1.8 x 10° binary systems with 
12 different metallicities, ranging from Z = 1074 to 0.02. Primary masses are drawn from 
a Kroupa initial mass function [45] between 5 and 150 Mo, while mass ratios, initial orbital 
periods and eccentricities follow the distributions presented in |46]. MOBSE adopts up-to-date 
models for stellar winds [44]. We model electron-capture, and (pulsational) pair-instability 
supernovae according to [47] and [48], respectively. For core-collapse supernovae, we use the 
rapid model by [49], which enforces a mass gap between neutron stars and black holes. The 
main difference with respect to [49] is that we model neutron star masses according to an 
uniform distribution between Mmin = 1.1 Mo and Mmax = 2.5 Mo, which is a better match 
to current LIGO-Virgo results [7]. We draw natal kicks from a Maxwellian distribution with 
one-dimensional root-mean-square o. = 265 km s7}, rescaled by the mass of the compact 
remnant and by the mass of the ejecta, see [50] for more details. During binary evolution 
we treat mass transfer, tidal evolution, gravitational-wave decay, and common envelope as 
described in [51]. For the common envelope, we assume an efficiency parameter a = 3. 
We describe the evolution with redshift by accounting for the metallicity-dependent star 
formation rate across cosmic time, as described in [52]. The value for the local merger rate 
for this model turns out to be Rọ ~ 250 Gpc^? yr-!, which is consistent with the range 
[10, 1700] Gpc^?yr^! obtained from the LVK results [7]. 

We generated the catalogs of binary black holes (BBHs) by mixing the isolated evolution 
channel (obtained as described above for BNSs) with the dynamical formation channel. We 
used the code FASTCLUSTER [53, 54], which evolves binary black holes in young, globular and 
nuclear star clusters, by accounting for their dynamical pair-up and for hierarchical mergers. 


*'This choice has the advantage of using quantities which are less degenerate among each other, leading to 
better conditioning of the Fisher matrices, thus improving the reliability of this approach. 
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In our catalogs, 55% of all BBHs come from isolated binary evolution, while 28, 15 and 2% 
come from dynamical formation in young, globular, and nuclear star clusters. We chose these 
mixing fractions because they ensure the best agreement with LIGO-Virgo black hole masses 
in the local Universe, see [54] for more details on the methodology. 


3.1 Binary Black Holes 


In this section we present the results obtained for BBHs. The data correspond to one year 
of data taking. With the assumed BBH population, the total number of BBHs coalescing in 
one year is about 1.2 x 10?. We first study the effect of varying the geometry, for detectors 
all with their full HFLF-cryo sensitivity, and we then discuss the effect of varying the ASD. 


3.1.1 Comparison between geometries 


In the upper-left panel of Fig. 4 we show the cumulative distribution of the events with 
a network SNR larger than a given value, while in all other panels, involving parameter 
estimation, we restrict to events that pass a detection threshold SNR > 12, and we show the 
corresponding cumulative distribution of events with an error smaller than a given value.? 
In each panel of this figure we show the results for the six geometries considered, all taken 
with their best ASD, i.e. in the xylophone configuration with the HF instrument and the 
cryogenic LF instrument (we use the label *HFLF-cryo' for this ASD). 

We see from the figure that even if, by itself, the performance of the 10 km triangle 
is extraordinary, the other geometries considered in general provide a further improvement 
(except for angular localization, where the 2L configurations with parallel arms have the worst 
performance). This is particularly evident in the reconstruction of the luminosity distance. 
In our population of 1.2 x 10? events (and in the specific sample drawn from it), the 10 km 
triangle can reach an accuracy on dr, equal or better than 196 for 28 events; this raises to 202 
events for 2L with 15 km arms at 45?, and 365 events for 2L with 20 km arms at 45?, which 


?More precisely, when performing parameter estimation, we restrict to events that, besides passing the 
threshold on the SNR, also have a well-defined inversion of the Fisher matrix. In some cases, the Fisher 
matrix is ill-conditioned, so its inversion can lead to large errors and we then discard the corresponding event, 
see [17] for details. For BBHs, the fraction of ill-conditioned Fisher matrices is of order 196, so has very 
little impact at this level, and is anyhow about the same percentage for different configurations, so this has 
essentially no impact on the comparison between configurations. For BNS, between 4% and 10% of the events 
are discarded, with the higher percentages appearing in the analyses using the ASD including only the HF 
instrument. There is also a slight dependence on the geometry especially in the HF-only case, with the 2L 
configurations resulting in worse conditioned matrices as compared to the triangular ones. However, the events 
discarded in these cases correspond preferentially to the situation when there is only one detector operating 
due to the duty cycle, in which case the Fisher matrix provides a poorer approximation. The ill-conditioned 
Fisher matrices often correspond at the population level to the events that would have anyhow large errors in 
parameter reconstruction (while the science output of the detectors depends mostly on the events with good 
reconstruction), but could also be related to events having a strong degeneracy between some parameters. In 
particular for BNS systems, where a higher percentage of events has ill-conditioned matrices as compared to 
BBH, the solution of discarding events, despite being conservative, might then leave a bias in the observed 
population, especially for specific science cases (as an example, events close to face-on/off are preferentially 
discarded due to the strong degeneracy present in this case between the inclination angle and the luminosity 
distance, but they are relevant e.g. for multi-messenger studies with GRBs). Another solution, adopted for 
the results presented in Section 4, is to regularize the close-to-singular matrices according to the procedure 
described in [31]. This allows us to analyze the non-degenerate parameters of a larger number of the events, 
at the price of introducing a dependence on the adopted regularisation threshold. For a recent discussion and 
comparison of these intrinsic limitations of the Fisher approach related to the inversion procedure see e.g. 
Appendices B and C of [17]. Each method has its own benefits, and it is reassuring that both have been used 
in the present analysis producing consistent results. 
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is the configuration that provides the best results (a set of tables with the explicit number of 
events with SNR above given thresholds, or errors on various parameters below given values, 
is collected in App. C). In general (not surprisingly), the 2L configuration with the longest 
arm length, 20 km, and relative arm orientation of 45°, is the one that performs better for 
all the parameters. 

We can also appreciate from Fig. 4 that the configurations of 2L with parallel arms 
performs quite poorly (comparatively) for angular localization. While this was generally 
expected because of the parallel arms, it is quite interesting to see that, in terms of angular 
resolution, even the 2L with 20 km parallel arms performs somewhat less well than a single 
10 km triangle, despite the longer arm length and the possibility of performing triangulation 
(even if only partial, since just two detectors are involved). Again, the best results are 
obtained with the 2L at 45?. For example, with our choices for the population and in our 
sample, the 10 km triangle can detect 35 BBH/yr with angular resolution below 1deg?, to 
be compared with 92 BBH/yr for the 15 km 2L at 45? and 155 BBH/yr for the 20 km 
2L at 45? (see Table 40 in App. C for a more extended set of examples). For the chirp 
mass reconstruction, the arm length sets the hierarchy among these configurations (see also 
Table 41 in App. C) so, again, among the configurations considered, the 10 km triangle is 
the least accurate. The results for the reconstruction of the symmetric mass ratio and of the 
spins show, instead, less pronounced differences among these configurations (note, however, 
the logarithmic scale, that visually flattens the differences; see again Table 41 in App. C for 
numerical values). 


From Fig. 4 and Tables 40, 41 we also see that the 15 km triangle has performances 
very similar to the 15 km 2L at 45?, except for the accuracy on the luminosity distance, 
where the 15 km 2L at 45? is clearly superior; for instance, the 15 km 2L at 45? would detect 
202 BBH/yr with error on the luminosity distance better than 196, to be compared with 
77 for the 15 km triangle. It is interesting to understand the reason for this. We discuss 
the issue in App. D where, examining the correlations between the various parameters for 
typical well-measured events, we find that, for the triangle, the error on dr, is more sensitive 
to the error on the localization, polarization angle and orbit inclination, so that errors on 
these parameters have a larger effect on the marginalized error on dr, compared to the 2L 
configuration. 


Finally, it is interesting to show the joint distribution of the events with respect to 
Adr /dr, and AQgoy, since this determines the overall localization volume. To avoid a pro- 
liferation of plots, here we limit ourselves to a comparison between the 10 km triangle and 
15 km 2L at 45?, which is shown in the left panel of Fig. 5, and between the 15 km triangle 
and 15 km 2L at 45?, shown in the right panel. We see that the 15 km 2L at 45? has many 
more events than the triangle in the region of the (Adr /dr, AQ90%) plane corresponding to 
an interesting three-dimensional localization, e.g., Adr /dr, € 1% and 


As far as these metrics are concerned, one can conclude that, while the performance of 
the 10 km triangle is by itself remarkable, the 2L configuration with 15 km arms, oriented 
at 45°, provides a further improvement (as the 2L configuration with 20 km arms or the 
15 km triangle). In contrast, the 2L with parallel arms looks quite disfavoured, because of 
a comparatively poor angular localization capability. The 15 km 2L at 45? and the 15 km 
triangle provide results quite comparable, except for the accuracy on luminosity distance, for 
which the 15 km 2L at 45° is clearly superior. 


Finally, in Fig. 6 we study the performance of a single L-shaped detector with 20 km 
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arms in the full HFLF cryo configuration, and we compare it with the 10 km triangle and 
with the 15 km 2L at 45°, also at their full sensitivity, as well as with LVKI O5. The most 
apparent feature is that, in terms of angular resolution and reconstruction of luminosity 
distance, a single L is clearly performing very poorly, and is even way below the LVKI O5 
forecast. "This shows that a single L-shaped detector, not inserted in a global network, is 
basically useless for those aspects of the Science Case, such as multi-messenger astronomy or 
cosmology, that require good estimates of sky-localization and distance of the sources. For the 
other parameters, however, a single L of 20 km, although it provides worse estimates, can be 
comparable to the 3G networks that we have considered.? 


3.1.2 Effects of a change in the ASD 


We next investigate how these results are affected by changes in ASD. In Fig. 7 we show, 
for the 10 km triangle, the cumulative distributions of the SNR and of parameter estimation 
(where again, for the SNR. we show the cumulative distribution of events with SNR larger 
than a given value, while for parameter estimation we restrict to events with SNR > 12 
and we show the cumulative distribution of events with error smaller than a given value) 
for the full ASD including the HF instrument and a cryogenic LF instruments (denoted 
'HFLF cryo’), and for the ASD in which the LF instrument is completely missing and only 
the HF instrument is operative (denoted ‘HF only’). We also show, for comparison, our 
forecasts for LVKI O5. In order to be consistent with the ET results, the analysis for 
LVKI O5 is obtained following exactly the same Fisher-Matrix analysis used for ET. For a 
more precise and realistic forecast, we refer to |23] which uses a Bayesian framework close to 
real analysis applied to the current gravitational-wave observations. 

An important message that emerges from Fig. 7 is that even if, at least in the first 
stage of operations, the LF instrument were not operational, still ET in its 10 km triangle 
configuration and HF-only sensitivity would provide a significant jump from 2G detectors, in 
terms of a number of detections, distribution of SNR and parameter reconstruction of BBHs, 
with the exception of angular localization (as expected, for a single infrastructure that cannot 
rely on triangulation). It is also worth noting that a single observatory, such as the 10-km ET 
triangle, at its best sensitivity, can localize a comparable number of sources with an accuracy 
as good as that of a network comprising five 2G detectors at full O5 sensitivity. 


Note that, on a single event basis, the events observed by LVKI are almost always better 
localized than by ET, showing that localization benefits more from the network triangulation 
than from ET's higher SNR (with fluctuations due to the assumed duty cycle). Nonetheless, 
the number of detections by ET is more than 10 times higher, and comprises high SNR 
events with low masses and/or high mass ratios. The high-SNR low-mass BBH mergers can 
be optimally localized accessing low frequencies and using the imprint of Earth's rotation in 
the longer observed signal. The localization of high-SNR events with large mass ratio bene- 
fits from the detection of higher-order modes. This results in a similar number of optimally 
localized sources in the cumulative distribution up to a sky-localization of about 6 square 
degrees, see Fig. 7. The larger amount of BBHs localized with sky-localization uncertainty 
larger than this threshold by ET compared to LVKI network depends on the detection effi- 
ciency; ET has much higher detection efficiency, reaches higher z and has many more distant 


This, of course, assumes that spurious transient noise and glitches could be vetoed and eliminated reliably 
since, for a single detector, it would not be possible to reduce them by using the coincidence between inde- 
pendent detectors, or (as for the triangle) the null stream. In particular, the detection of unmodeled burst 
signals could not be reliably claimed with a single detector. 
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events where LVKI is no more able to detect sources. Going to larger and larger redshifts, 
on average the SNR decreases, and the ET localization worsen. 


Fig. 8 shows the corresponding results for the 2L configuration with 15 km arms at 45?, 
with the two different ASDs. For comparison, in this figure we also show both the results 
for the 10 km triangle in the full HFLF-cryo configuration, and the results for LVKI O5. 
It is interesting to observe that, even in the configuration in which the LF instrument is 
completely missing, for BBHs the 2L with 15 km arms at 45? is superior to the 10 km 
triangle with its full (HFLF cryo) sensitivity for the accuracy on the luminosity distance 
(particularly for the number of events with very good accuracy, say Adr /dr € 1%, with 56 
events against 28, see Table 40 in App. C) and gives basically equivalent results for angular 
localization, spins and orbit inclination. In contrast, for the chirp mass and symmetric mass 
reconstruction, the full HFLF-cryo triangle is better than the HF-only 2L-15km-45?. In the 
comparison with LV KI O5 we see that, even in the HF-only configuration, the 2L with 15 km 
arms at 45? provides a remarkable jump from 2G detectors on BBH detection number, SNR 
distribution and parameter reconstruction, except that the angular localization of the best- 
localized events would no longer be better than that of the five-detector LVKI network at 
O5 sensitivities (although the number of events with ANg% < 10? deg”, relevant for multi- 
messenger studies, is a factor ~ 5 higher for the 2L with 15 km arms at 45°, compared to 
LVKI O5). See again Tables 40 and 41 in App. C for a compilation of number events for the 
different geometries and ASDs, and different cuts on the parameters. 

A main message that emerges from Fig. 8 is that, as far as BBH parameter reconstruc- 
tion is concerned, even if in a first stage of operations the whole LF instrument should not 
be operative, a 2L configuration with 15 km arms at 45° would still be a very competitive 8G 
instrument, in fact quite comparable to a 10 km triangle at full sensitivity (with better perfor- 
mance of luminosity distance, less good performance on mass reconstruction, and equivalent 
performances on all other parameters and in SNR distribution).'' 


Finally, another important target of 3G detectors are intermediate massive black holes 
(IMBHs) binaries, with masses in the range 10? — 10* Mo, In this case, there are currently 
no observational constraints, and thus we do not use any astrophysically motivated IMBH 
population. Going back to Fig. 2, which showed the horizons for monochromatic non-spinning 
populations of equal-mass binaries (or Fig. 3, that directly gives the relative differences with 
respect to the baseline 10 km triangle), it can be deduced that, with the best sensitivity 
curve (HFLF-cryo configuration), for total masses above about (300 — 400) Mo, the difference 
between the different ET geometries is at about 10% level. For masses around 100Mo the 
difference is larger, and for instance for 100M the detection horizon ranges between z ~ 42 
and z ~ 50 for the different geometries; however, in this redshift range we do not even expect 
astrophysical IMBH. Much more significant is the dependence on ASD. The right panel of 
Fig. 2 show a significant reduction in the IMBH horizon for the HF-only configuration. This 


"Tt should also be observed that, in most studies where the accuracy on both volume localization and 
masses is relevant, as for instance in joint studies of population and cosmological parameters, the bottle-neck 
is in the accuracy on sky localization and luminosity distance, which are always much worse than the relative 
accuracy on the (detector-frame) masses. As we see e.g. from Fig. 8, the relative error on the (detector-frame) 
chirp mass and on the symmetric mass ratio is typically well below 10 ^^ and can be as low as 10? for the 
chirp mass and 1078 for 7, while, for dr, it is difficult to go below the 196 accuracy. Therefore, having a 
better performance on dr has a much stronger impact than having a better performance on the detector- 
frame masses, which in any case are very accurately measured. Note, furthermore, that at the large redshifts 
explored by 3G detectors, the accuracy of the reconstruction of the actual (source-frame) masses from the 
detector-frame masses will be determined by the accuracy of the reconstruction of the redshift. 
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shows the importance of the low-frequency sensitivity for IMBHs. 


3.1.3 Golden events 


Another important metric to assess the performance of a detector, or of a detector network, 
is provided by the number and redshift distribution of ‘golden events’, ie. events with 
especially good properties in terms of SNR (e.g. BBHs with SNR > 100), or error on 
luminosity distance (e.g. BBHs with Adz/dz < 0.05) or angular localization (e.g. BBHs 
localized to better than 10 deg”). Indeed, while for some aspects of the Science Case, such as 
population studies, the completeness of the sample is a key element, for other aspects, such as 
precision tests of General Relativity (GR) or cosmological studies, the result will be largely, 
if not uniquely, determined by these 'golden events' (see the discussion in Section 5.4 of [17] 
and Section VI of [19]). In Fig. 9 we show the redshift distribution of the BBHs detected with 
SNR > 100, the distribution of events with Adz /dr, < 5%, and the distribution of events with 
AQ% X 10 deg?.!? In the upper row we show the results for the six detector geometries 
that we are considering, all taken at full (HFLF cryo) sensitivity. The middle row focuses 
on the 10 km triangle and shows how the result changes with the ASD, while the lower row 
shows the same result for the 2L with 15km arms at 45? (comparing also with the reference 
triangle configuration). In all plots, we show for comparison the redshift distribution of the 
BBH population that we have used and, for each detector configuration, the distribution of 
the events detected with SNR > 8 (which, on this scale, is almost indistinguishable from the 
whole population, at least for z <2). 

From the left panel in the upper row, and Table 39 in App. C, we see that, in terms 
of detections of events with large SNR, the 10 km triangle by itself has remarkable perfor- 
mances, with (in our sample realization) 2298 BBH/yr detected with SNR > 100, and a 
redshift distribution extending up to z ~ 5. However, this is further improved in the other 
configurations; in particular, in the 2L-15km configurations, the number of BBH/yr with 
SNR > 100 is 4933 for the misaligned configuration (and 5143 for the aligned-arms config- 
uration), more than twice as large, and extends up to z ~ 7; this further raises to more 
than 8000 for the 2L-20km configurations. On the other hand, we already saw from Fig. 4 
that, over the whole ensemble of detected events, the setting with parallel arms has compar- 
atively poor performance for angular localization and accuracy on the luminosity distance; 
this remains true also when we restrict to golden events, as we see from the middle and the 
right panel in the first row of Fig. 9. In particular, the 2L configurations with parallel arms 
(whether with 15 or 20 km arm-length) are by far the less performing in terms of events with 
AQ < 10 deg? (followed by the 10 km triangle) while, in terms of the events with errors 
smaller than 5% on dz, the 10 km triangle is, comparatively, the less performing, followed by 
the 2L configurations with parallel arms, while the 2L configurations with arms at 45? give 
the best results. 

These results indicate that, for BBH golden events, the 2L configuration with arms at 
45°, or the 15 km triangle, provide the best compromise between detecting many of them, 
and out to large redshift, and localizing them, further improving on the already remarkable 
performances of the 10 km triangle. 


1?Of course, these distributions are correlated. For instance, events with large SNR typically have small 
errors on dr. The correlation between luminosity distance and angular resolution is shown, for BBHs, in 
Fig. 5, and the analogous result for BNS will be shown in Fig. 11. See Figs. 12 and 17 of [17] and Figs. 5-9 of 
[18] for further scatter plots of the correlations among different parameters, for the ET triangle configuration. 
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The middle row of Fig. 9 shows how the results depend on the ASD, for the 10 km 

triangle. From these plots we infer that, also for golden BBH events, a 10 km triangle would 
provide an extraordinary jump, compared to 2G detectors, even in the HF-only configuration. 
Indeed, by comparison, at LVKI O5, even having assumed the best-planned sensitivities for 
all five detectors, with our BBH population and the specific sample that we have drawn 
from it, we find only 4 events with SNR > 100, to be compared with about 3000 for the 
10 km triangle with the full HFLF-cryo ASD, and almost 800 for the 10 km triangle with 
the HF-only ASD, see Table 40 in App. C. 
The lower row of Fig. 9 shows the dependence on the ASD for the 2L geometry with 15 km 
arms at 45? (showing, for comparison, also the result for the 10 km triangle at full sensitivity). 
Once again, a significant conclusion is that the 2L geometry with 15 km arms at 45°, even 
with the HF-only instrument, is comparable to the 10 km triangle at full sensitivity. 
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Figure 4: Cumulative distributions of the number of detections per year, for the SNRs and for the 
error on the parameters, for BBH signals, for the six considered geometries, all with their best ASD, 
including xylophone configuration and cryogenic LF instrument. 
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Figure 5: The joint accuracy on luminosity distance and angular resolution for BBHs. Left panel: 
the 15 km 2L at 45? (blue) compared to the 10 km triangle (green). Right panel: the 15 km 2L at 
45? (blue) compared to the 15 km triangle (red). 
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Figure 6: Comparison of SNR and parameter estimation error for the 10 km triangle, 15 km 2L 
misaligned and a single 20 km L with the HFLF-cryo ASD. By comparison, we also show the forecast 
for LVKI O5. 
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Figure 7: Comparison of SNR and parameter estimation error for the 10 km triangle with the 
HFLF-cryo and the HF-only ASDs. By comparison, we also show the forecast for LVKI O5. 
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Figure 8: Comparison of SNR and parameter estimation error for the 15 km 2L misaligned with the 
HFLF-cryo and the HF-only ASDs. For comparison, we also show the result for the 10 km triangle 
in the HFLF-cryo configuration, and for LVKI O5. 
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Figure 9: 


a 


Redshift distribution of BBHs detected with SNR > 100 (left column), or with relative 


error on the luminosity distance Adz /dr, < 0.05 (central column), or with sky location AQ < 10 deg? 
(right column) for various detector geometries and sensitivity curves. The upper row shows the results 
for the six considered geometries, all with their best sensitivity, the central row for the 10 km triangle 
with the two considered ASDs, and the bottom row for the 2L with 15 km arms at 45? and the two 
considered ASDs (together with the 10 km triangle with the full ASD). In each panel we also show, 
for reference, the redshift distribution of the BBH population used (black solid line), and of the events 
detected in the various configurations with SNR > 8 (dotted lines). 
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3.2 Binary Neutron Stars 


We now perform the same analysis for BNSs. The corresponding results are shown in Figs. 10- 
15. We discuss again separately the dependence on geometry, on ASD, and the ‘golden 
events’. 


3.2.1 Comparison between geometries 


Fig. 10 confirms the basic message obtained from BBHs in Fig. 4: the 10 km triangle has, in 
itself, remarkable performances and produces order-of-magnitudes improvements with respect 
to 2G detectors. However, a further improvement can be obtained from the other geometries 
that we have considered, on almost all parameters. The only exceptions are the inclination 
angle ų¿, for which the 2L configuration with parallel arms performs, comparatively, quite 
poorly, both for 15 km and 20 km arm-length,!? and the luminosity distance, for which the 
10 km triangle performs similarly to the 2L with 15 km parallel arms. Similarly to what we 
found for BBHs, also for BNSs the 2L 15 km with arms at 45? improves on the 10 km triangle 
on all parameters. Comparing instead the 2L 15 km at 45? to the 15 km triangle, we see that 
the two perform similarly on most parameters except for luminosity distance, for which the 
2L 15 km at 45° is significantly better. The 15 km triangle, however, performs better than 
a 15 km 2L with arms at 45° for the inclination angle v, while the 2L 15 km at 45? performs 
better for the polarization angle v. 

For the parameter A related to the tidal deformability of the two NSs, again the 10 km 
triangle already obtains remarkable performances, with 1040 events/yr with A measured 
better than 1096, and the best events providing a measurement at a few percent level; however, 
all other geometries that we considered improve further on this; in particular, the 2L-15km- 
45? detects 2463 events/yr with A measured better than 1096, so an improvement by a factor 
of 2.5, and the 15 km triangle 2783, see Tables 42, 43 and 44 in App. C. 

Fig. 11 shows the joint accuracy on luminosity distance and angular resolution for 
BNSs, comparing the 15 km 2L at 45? to the 10 km triangle (upper panel) and to the 
15 km triangle (lower panel). Considering, for instance, the events with Adr /dr, < 0.1 and 
A Cos, < 10? deg”, we see that the 


Fig. 12 shows the performance of a single L-shaped detector of 20 km for BNSs, com- 
pared to other geometries, similarly to Fig. 6 for BBH. We observe that, with a single 
L-shaped detector, no BNS will have an angular resolution AQgoy, below 10 deg?, and (in our 
sample realization) only 6 BNS/yr would have AQgoy, < 10? deg? (to be compared with 184 
BNS/yr for the 10 km triangle and 559 BNS/yr for the 2L configuration with 15 km arms 
at 45?, see Table 43 in App. C.). The reconstruction of distances would also be compara- 
tively poor, with just a handful of events measured to better than 1096; so, without other 3G 
detectors, a single L-shaped detector, even with 20 km arms, would miss all the aspects of 
the Science Case, such as multi-messenger astronomy or cosmology, that require good source 
localization. 


3.2.2 Effects of a change in the ASD 


Fig. 13 shows the result for the two ASDs of the 10 km triangle and compares them with 
LVKI O5. Quite interesting results emerge from the panel on the angular resolution. A five- 
detector network, such as LVKI O5, would of course have excellent localization capabilities 


The difference with respect to the angle 0; shown in the BBH case is that, for BBHs, higher-order modes 
and precessing spin significantly help to determine the inclination angle. 
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for some very close events, so that it can reach a localization below 1 deg? on a few events 
per year, which a single 10 km triangle, even with the full ASD, cannot reach. However, this 
will be limited to a handful of particularly close events. What is also relevant, in the panel 
for AQooy, of Fig. 13, is the number of BNS with AQgoy below about 10? deg?. In fact, such 
an angular localization accuracy can already be sufficient to perform multi-messenger studies 
with wide field of view (FoV) electromagnetic observatories, as has been demonstrated by 
the current follow-up of gravitational-wave signals (see e.g. [4, 55]). In this case, the single 
triangle is quite superior to the LVKI O5 network, by detecting a number of well-localized 
sources higher by a factor ~ 3.6 (see Table 43 in App. C for a compilation of numerical 
values relative to these figures, and Section 4 for a detailed discussion and implications for 
multi-messenger studies). However, this is true only for the triangle at full sensitivity, i.e. in 
the HFLF-cryo configuration. The ET full sensitivity localization significantly benefits from 
the use of the effect of Earth's rotation on the longer signal detected, thanks to the access to 
lower frequencies. The benefit is much larger for closer events where the BNS signal can be 
followed for a longer time. In terms of the number of BNSs with angular resolution below 
10? deg?, the HF-only configuration is therefore much worse than LVKI O5. For most other 
parameters, in contrast, a 10 km triangle provides a significant jump with respect to LVKI O5 
even in the HF-only configuration (except for luminosity distance and polarization, where it 
is worse; this is because these quantities are more correlated with angular localization). See 
again Table 43 in App. C for a compilation of numerical values. 

Fig. 14 show the analogous results for the 2L configurations with 15 km arms at 45°; 
for comparison, we also show both LVKI O5 and the 10 km triangle at full sensitivity. First 
of all, we see again that, at the level of full sensitivities, the configurations with 15 km arms 
at 45? is superior to the 10 km triangle for all parameters. We also see that, for angular 
localization, the HF-only configuration is much worse than LVKI O5, at least for events with 
AQ% < 10? deg?, which are the relevant ones for multi-messenger studies, while, for all 
other parameters, even the HF-only configurations provides a remarkable jump compared to 
LVKI O5. However, contrary to the BBH case, the HF-only configuration is now very much 
inferior to the 10 km triangle with the full ASD, particularly for sky localization. This is due 
to the fact that the improvement of the low-frequency sensitivity allows a BNS to remain in 
the detector bandwidth for hours and even up to a day, as we already mentioned above (see 
also Fig. 2 of [17]), and then the rotation of the Earth improves the BNS localization. This 
is less relevant for BBHs since most of them have a total mass greater than about O(20 Mo), 
and stay in the detector bandwidth for a time too short to appreciate the effect of Earth's 
rotation. Therefore, the low-frequency sensitivity, and therefore the LF instrument, is more 
important for the localization of BNSs compared to BBHs. This shows the crucial importance 
of the LF instrument for BNSs. 


3.2.3 Golden events 


Fig. 15 shows the redshift distribution of ‘golden events’ for BNSs, defined here either by the 
condition SNR > 30, or by Adz /dr, € 0.2, or by ANg% € 10? deg?. These conditions are less 
stringent than those used for BBHs in Fig. 9, reflecting the fact that BNSs are intrinsically 
less loud than BBHs. The upper row compares the different geometries, all taken with their 
best ASD, and basically confirms the picture obtained for BBH golden events, compare with 
the upper row of Fig. 9: for the SNR, the 10 km triangle gives the less good results, while the 
other configurations are comparable, with the 2L with 20 km parallel arms being the best; 
for luminosity distance and angular localization, the 2L configurations with arms at 45? give 
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the best results and, overall, appears to provide the best compromise between having a large 
number of events with large SNR out to large distances, and localizing them well in volume. 

The middle row quantifies the loss of performance of the 10 km triangle when the 
LF instrument is absent. The loss is particularly dramatic for the accuracy on luminosity 
distance and sky localization. This confirms the crucial importance of the LF instrument for 
BNS, as we already discussed when commenting Fig. 14. 

From the bottom row we see that, also on this metric, the 2L configurations with arms 
at 45? and the HF-only ASD gives results basically identical to the 10 km triangle with full 
HFLF-cryo sensitivity for the SNR distribution; in contrast, we already remarked on the full 
ensemble of events that the LF instrument is crucial for angular localization and accuracy 
on luminosity distance for BNSs, and we see the same information emerging from golden 
events, where the HF-only configuration of the 2L configurations with arms at 45? is very 
much inferior to the full 10 km triangle, in contrast to what happens for BBHs, compare 
with Fig. 9. As already discussed, this is due to the fact that, with a good low-frequency 
sensitivity, BNSs can stay in the detector bandwidth for hours to about 1 day, with the 
corresponding benefit in angular localization and in all parameters correlated with it. 


3.2.4 Dependence on the population model 


Finally, it is interesting to discuss how these results depend on the model that we have used 
for the BNS population (and on the specific random sample extracted from it). First of all, we 
remark that the differences between designs that emerge from these plots, and the conclusions 
that we draw from them, are not driven by the tails of the distributions (that, even for a given 
population model, would also depend on the sample realization), but rather by their overall 
behavior, which in general involves hundreds of events; even for the ‘golden events’, there are 
several tens to several hundreds events per bin, see Fig. 15. To understand in full generality 
how the results depend on the population model used is quite difficult; the full set of runs, 
on all different detector configurations studied, is computationally expensive, and it is not 
feasible to explore a large parameter space of population models in a reasonable timeframe. 
Our strategy has rather been to use a state-of-the art population model [52] for BNS and 
NSBH (and [54] for BBHs), which are able to match the LVK rates. However, a useful 
check, for BNS, can be performed as follows. For the redshift distribution of the BNS rate, 
current models are quite sophisticated and we do not expect great variations (furthermore, 
we are not much interested in the absolute numbers of detections in a given configuration, but 
rather in the relative performances of the detectors). There is large uncertainty in the overall 
normalization, given by the local rate, which for BNS can be up a factor 10 higher or lower 
than the one that we use. However, while this will influence significantly the absolute number 
of detections per year, again it will not affect much the comparison between detectors, which 
is our main aim. For instance, if the local rate should turn out to be a factor of 2 smaller 
than the one that we use, the plots that we show will have to be understood as relative to 
2 years of data, rather than 1 yr; however, the conclusions for the relative performances of 
the different design would be basically unaffected (of course, the tails of rare events will be 
affected for a given fixed observation time but, again, the conclusions are not really dependent 
on these tails). So, for our purposes the parameter that is most significant to vary, for BNS, 
is the mass distribution. As mentioned on page 10, the results for BNS presented above 
were obtained with a flat mass distribution between 1.1 and 2.5 solar masses. We have 
repeated the analysis using a Gaussian mass distribution, with mean 1.33M and standard 
deviation 0.09 M5, sampled independently for both masses (we then impose that the heaviest 
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one corresponds to m1). The results, for the six considered geometries in their full HFLF- 
cryo configuration, are shown in Fig. 16. Comparing with Fig. 10, we see that the results are 
very consistent. There is a somewhat smaller number of overall detections (because in the 
gaussian distribution events with large masses are suppressed), but the parameter estimation 
is quite similar (with differences in the tails, expected because of sample variance), and the 
conclusions for the comparison between detector configurations are unchanged. 
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Figure 10: Cumulative distributions of the number of detections per year, for the SNRs and for the 
error on the parameters, for BNS signals, for the six considered geometries, all with their best ASD, 


including xylophone configuration and cryogenic LF instrument. 
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Figure 11: The joint accuracy on luminosity distance and angular resolution for BNSs. Left panel: 
the 15 km 2L at 45° (blue) compared to the 10 km triangle (green). Right panel: the 15 km 2L at 
45° (blue) compared to the 15 km triangle (red). 


— 98 — 


BNS 


2L 45? 15 km HFLF cryo 


— A 10 km HFLF cryo LVKI O5 


1L 20km HFLF cryo 


Cumulative number of events 


Cumulative number of events 
ce © [e] ise] © 
> 2 Q% & 9 


10-6 1075 1074 107? 107? 


Ke wn 
E = 
g 054 g 054 
o o 
ua uw 
° 1014 © 101] 
w we 
Oo o 
à à 
E 0? 4 =I 03 4 
o 2 J o 24 
2 > 10 
E E 
d 04 d 014 
E E 
3 3 
QO 10? j O 104 
107! 10° 10! 10? 10? 104 107? 107? 107! 10° 10! 


AN0% (deg?) Ad; /di 


Cumulative number of events 


Cumulative number of events 
2 ei em E e 


10-4 107 107? 1071 10° 
At (rad) 

j= B 
o 5 o 5 

0? 4 05d 
5 5 
GH GH 
o 044 o 044 
PE P 
Oo Oo 
à à 
& 104 E 1053 
3 3 
E a 

2 J 2J 
E E 
Z 014 Z 01d 
E E 
3 3 
O 1094 O 10? 4 

107? 107? 10-1 10° 107? 107? 10-1 109 

Ay (rad) AA/A 


Figure 12: Comparison of SNR and parameter estimation error for a single 20 km L-shaped detector 
with the HFLF-cryo ASD, compared to the 10 km triangle, a 2L with 15 km arms at 45°, and the 
forecast for LVKI O5. 
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Figure 13: Comparison of SNR and parameter estimation error for the 10 km triangle with different 
ASDs. By comparison, we also show the forecast for LVKI O5. 


— 30 — 


— A 10 km HFLF cryo 
— — 2L 45° 15 km HFLF cryo 


BNS 


—— — 2L 45° 15km HF only —— LVKI O5 


2 r a 
= 1 — 
= ] =| 
oO 0? 4 1 [5] 04 
5 5 
GH uw 
© 104] © 104] 
E z 
E 10] "E 1054 
Z E 
€ 10?4 L 10? 3 
B B 
EE zs 
E 0^3 E 0! d 
E 0 5 0 
O 10 i ——— — 03 — T T M 
107! 109 10! 10? 10? 107 10-6 10-5 1074 10-3 107? 
SNR AM.JM. 
n n 
E E 
Oo 5 o 5 
054 054 
5 5 
GH GH 
o 044 o 014 
z z 
Es a 
E 03d g 03 4 
Z E 
) 2 J 2 J 
8 0^3 E 0^3 
3 = 
O 10°! O 1004 
107! 10° 10! 10? 10? 10* 10-3 107? 107! 109 10! 
2 
ANu% (deg?) Adr./dr, 
u ul 
E E 
o o 5 
= 0 ^ 
5 Em 
c uw 
o o 044 
=I =I 0? 4 
3 3 
= a 
2J 
E a 
+ — 
E: es oci 
3 z 10 
E E 
3 3 
O O 10 ges : supe = 
10-4 107? 107? 107! 109 
At (rad) 
a a 
E E 
€ 1054 L 1053 
5 5 
GH GH 
O 1011 O 1014 
z g 
à a 
=I 034 =I 03 4 
Z E 
2] 2J 
£ 10 gw 
E 0^3 E 0^ 3 
d mi 
O 10°! © 1094 
107? 107? 107! 10° 10-3 107? 107! 10° 
Ay (rad) AA/A 


Figure 14: Comparison of SNR and parameter estimation error for the 2L with 15 km arms at 45°, 
with the two different ASDs. For comparison, we also show the 10 km triangle at full sensitivity, and 


the forecast for LVKI O5. 
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Figure 15: Redshift distribution of BNS detected with SNR > 30 (left column), or relative error 
on the luminosity distance Adr/dr < 0.2 (central column), or with sky location AQ < 100 deg? 
(right column) for various detector configurations and sensitivity curves. The upper row shows the 
results for the six considered geometries, all with their best sensitivity, the central row for the 10 km 
triangle with the two considered ASDs, and the bottom row for the 2L with 15 km arms at 45? and 
the two considered ASDs. In each panel, we also show, for reference, the redshift distribution of the 
BNS population used (black solid line) and the events detected in the various configurations with 


SNR > 8 (dotted lines). 
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Figure 16: As in Fig. 10, for a gaussian mass distribution of the two component neutron stars. 
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3.3 ET in a network of 3G detectors 


The above analysis studied the capabilities of ET alone (whether consisting of a single- 
site triangle or of two L-shaped detectors on widely separated sites), comparing the results 
obtained with different possible geometries and sensitivity curves. We now study how the 
different geometries for ET (triangle or 2L) perform when they are included in a network 
of 3G detectors which includes Cosmic Explorer. The reference CE configuration currently 
consists of two L-shaped detectors (on the Earth's surface rather than underground), one 
with 40 km arms and one with 20 km arms [14]. We will refer to this configuration as 
‘2CE’ and we will study how different choices for the ET geometry affect the performance 
of a network made by ET and 2CE. Given the current uncertainties, it is also instructive 
to investigate what happens when ET (in a triangular or 2L geometry) is part of a network 
with a single CE detector, which we take to be the 40 km one (‘1CE’). In order to avoid a 
proliferation of plots, we limit here to three options for ET: the 10 km triangle, and the 15 km 
2L configurations, either with parallel arms or with arms at 45°. Therefore, we consider the 
following networks: 


1. ET as a triangle with 10 km arms, together with 2CE (204-40 km) 

2. ET as a 2L of 15 km and parallel arms, together with 2CE (202-40 km) 
3. ET as a 2L of 15 km and arms at 45°, together with 2CE (20+40 km) 
4. ET as a triangle with 10 km arms, together with 1CE (40 km) 

5. ET as a 2L of 15 km and parallel arms, together with 1CE (40 km) 


6. ET as a 2L of 15 km and arms at 45°, together with 1CE (40 km) 


In particular, the last two options correspond to a global network of three L-shaped 
detectors, two of 15 km and with the ET ASD, and one of 40 km and the CE ASD. In all 
these configurations, for ET we will consider only the full sensitivity, corresponding to the 
ASD that we have labelled as ‘HFLF-cryo’ and, for CE, we use the latest publicly available 
official sensitivity curves.!^ 

Fig. 17 shows the SNR distribution and the errors on parameter estimation for BBHs 
for the six networks considered, while Fig. 18 shows the corresponding results for BNSs. 
From Fig. 17, to be compared with the green, blue and orange lines in Fig. 4 referring to 
ET alone, we see that, as expected, the overall performance of the network is significantly 
better than that of ET alone for angular localization and distance determination. As an 
example, the BBH/yr localized to better than 1 deg? raise from a few tens (depending on 
the ET geometry) to a few times 10?. On the other parameters, the gain from adding one 
or two CE detectors results typically in an increase by a factor of order 2 of the number of 


™The most recent PSDs of CE can be found at https: //dcc.cosmicexplorer.org/CE-T2000017/public. 
We used the locations and orientations reported in rows C and N of Table III of [30]. In particular, the 40 km 
detector is placed in Idaho and the 20 km one in New Mexico. These orientations are such that the relative 
angle between the two CE detectors, measured with respect to the great circle that joins them, is about 56.0°. 
When ET is taken to be in the 2L configuration, with our choice of orientations the relative angle between the 
Sardinia detector and the CE-Idaho is about 191.2? and that between the Sardinia detector and the CE-New 
Mexico one is about 237.6°. As in the discussion in footnote 4, if a 2L configuration should be retained for ET, 
a careful study of the optimization of the relative orientations of the ET+2CE detector network will become 
necessary. 
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events with accuracy better than a given value of the chosen parameter; e.g., in our specific 
realization, for the symmetric mass ratio 7, the number of events per year with An < 10-9 
raises from 559 for ET alone in the 10 km triangle configuration, to about 922 in a network 
with 1CE and 1018 with 2CE; similar results hold for the other parameters. 

As we see from Tables 39, 40 and 41 in App. C (factors of 2 are difficult to appreciate 
on the logarithmic scale of Fig. 17), in a network with a 40km CE, for BBHs the different 
geometries chosen for ET induce overall differences in the results at the level of factors ~ 1.5 
or smaller. As an example, the number of BBH/yr with dr, measured better than 1% raises 
from 2901 for the 10 km triangle--1CE, to 4301 for 2L-15km-45°+1CE, while the number of 
BBH/yr localized to better than 10 deg? raises from 3.0 x 10* to 3.6 x 104. 


For BNS, the effect of the ET geometry in a network with 1 or 2CE can be seen from 
Fig. 18, and especially from Tables 42, 43 and 44. For instance, the number of BNS/yr 
with SNR > 150 raises from 87 for the 10 km triangle+1CE, to 116 for 2L-15km-45°+1CE, 
while those with dr, measured better than 1096 increases by about a factor of 2, from 4100 
for the 10 km triangle+1CE to 7949 for 2L-15km-45°+1CE. Similar differences appear for 
the chirp mass, symmetric mass ratio and tidal deformability. For instance, the number of 
BNS/yr with tidal deformability measured to better than 10% raises from 2669 for the 10 km 
triangle+1CE to 4753 for 2L-15km-45°+1CE, and from 3847 for the 10 km triangle+2CE to 
6319 for 2L-15km-45°+2CE. 

So, among the three geometries considered here, the one that gives consistently the best 
results is, again, the 2L with 15 km arms at 45?, with typical improvement with respect 
to the 10 km triangle of about a factor of ~ (1.5 — 2). The configuration 2L-15km-0°, 
when in a network with 1CE, gives similar performances to 2L-15km-45? for masses or tidal 
deformability, but is worse by a factor ~ 2.5 for angular localization and ~ 4 for luminosity 
distance. For instance, the number of BNS/yr with dr, measured better than 1096 is 2079 for 
2L-15km-0?--1CE and 7949 for 2L-15km-45°+1CE. In a network with 2CE the differences 
become smaller. 

The overall conclusion is that the difference between the performances of the ET geome- 
tries remains visible, at the level of factors ~ (1.5 — 2) for BNSs, and somewhat smaller for 
BBHs, also when ET is in a network with 1 or 2 CE. 
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Figure 17: Comparison of SNR and parameter estimation error for BBHs, for the six 3G detector 


networks considered. 
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Figure 18: Comparison of SNR and parameter estimation error for BNSs, for the six 3G detector 
networks considered. 
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Figure 19: Redshift distribution of detected BBHs with SNR > 100 (left column), or relative error 
z Y , ; 275 
on the luminosity distance Adr/dry < 0.01 (central column), or sky location AQ < 1 deg* (right 
column), for the six 3G the detector networks considered. In each panel, we also show, for reference, 
the redshift distribution of the BBH population used (black solid line), and of the events detected in 
the various configurations with SNR > 8 (dotted lines, graphically almost indistinguishable from the 
full population on this scale). 
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Figure 20: Redshift distribution of detected BNS with SNR > 30 (left column), or relative error 
on the luminosity distance Adr/dr < 0.05 (central column), or sky location AQ < 5 deg? (right 
column) for the six 3G the detector networks considered. In each panel, we also show, for reference, 
the redshift distribution of the BNS population used (black solid line), and of the events detected in 
the various configurations with SNR > 8 (dotted lines). 
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4  Multi-messenger astrophysics 


In this section we study the impact of the different detector geometries and different ASDs 
on the multi-messenger capabilities of ET operating in synergy with electromagnetic (EM) 
observatories. ET will observe together with a new generation of innovative EM observatories, 
such as CTA [56], Athena [57], the Vera Rubin Observatory [58], JWST [59], ELT [60], 
SKA [61] and with the possible mission concepts UVEX [62], THESEUS [63], HERMES [64], 
GRINTA, ASTROGAM [65] and AMEGO [66]. These observatories will probe the formation, 
evolution, and physics of BNS systems in connection with kilonovae (KNe) and short gamma- 
ray bursts (GRBs) along the star formation and chemical evolution of the Universe. The 
multi-messenger studies will tremendously benefit from the larger sample of joint GW/EM 
detections and, in the closer Universe, from the greatly improved parameter estimates of ET 
with respect to the current detectors. 

Taking into account the sky-localization capability of ET and the fact that the KN 
emission is intrinsically fainter than that of GRBs, future multi-messenger astronomy will 
differ at closer redshifts (« 0.3 — 0.4), where it will be possible to detect KN emission using 
the wide-field telescopes, such as the Vera Rubin Observatory, from the distant Universe 
where the detectable counterparts at large redshift will be almost uniquely observed in the 
high-energy band through the beamed emission of GRBs. While the joint GW/KN detections 
will enable unprecedented studies of the enrichment of the Universe with heavy elements, of 
nuclear physics to constrain the equation of state (EoS) of NSs and cosmology estimating 
the expansion rate of the Universe, the joint GW/GRBs detections will unveil the structure 
and properties of relativistic jets, the emission mechanism of short GRBs, the role of merger 
remnants such as magnetars, and will make it possible to estimate cosmological parameters 
and test modified gravity at the cosmological scale. 

The main parameters determining the joint GW/EM detections, and thus affecting the 
multi-messenger performance of ET, are the ability to localize the source and the volume of 
the Universe up to which ET will be able to detect the source in terms of achieved redshifts. 
Another important aspect of third-generation GW detectors is the ability to detect BNSs 
before the merger and send pre-merger alerts with a good estimate of the sky localization 
of the source. This is critical for detecting prompt and early emissions in EM bands where 
all-sky monitors are not operating. 

Here, we examine the sky-localization capabilities as a function of redshift and the pre- 
merger alert scenarios, and we evaluate the joint detections for ET observation together with 
examples of high-energy satellites and optical wide-FoV telescopes. Differently from the other 
sections, in this section, we use an SNR detection threshold of 8 taking into account that the 
presence of a counterpart increases the significance of the source detection. 


4.1 BNS sky-localization and pre-merger alerts 


The sky-localization capability of GW detectors has a major impact in the efficiency of the 
search for a counterpart determining the number of pointings of the EM instruments to cover 
the GW signal location, the corresponding amount of observational time to be used and the 
search sensitivity, and the efficiency to select and remove contaminating transients. Starting 
from the BNS population described in Section 3, GWFISH [31] is used to evaluate the localiza- 
tion capability and build Table 1, which gives the expected number of detections (SNR > 8) 
per year with sky-localization uncertainty (given as 90% c.l.) smaller than a threshold of 10, 
40, 100, 1000 deg? for the different ET configurations (we do not consider the 2L aligned 
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Figure 21: Redshift distribution of the sky-localization uncertainty (given as 9096 credible region) 
of randomly oriented BNS systems as a function of redshift for four detector geometries: the 10 km 
triangle (A 10 km), the 15 km triangle (A 15km), the 2L with 15 km misaligned arms (2L 15 km), 
and the 2L with 20 km misaligned arms (2L 20km). The plots on left and central columns show 
the full (HFLF cryo) sensitivity detectors. In comparison, the plots on the right columns show the 
performance of the detectors operating only with the HF interferometers. The absolute numbers are 
relative to one year of observation and assuming a duty cycle of 0.85 as described in the text. 


configurations because as shown in Section 3.2 they yield worse localization capabilities with 
respect to the misaligned configurations). Figure 21 shows the redshift distribution of the 
sky-localization uncertainty for all the detected BNS events. The distribution is given as a 
function of redshift for four detector geometries; 10 km triangle, 15 km triangle, 2L with 
15 km misaligned arms (i.e. arms at a relative orientation of 45?), and 2L with 20 km mis- 
aligned arms. The different colours show the detections localized within a certain threshold 
on the sky-localization uncertainty. Comparing the plots on the left and central columns of 
the figure, which show the full (HFLF cryo) sensitivity detectors, it emerges that the 2L with 
15 km misaligned arms performs better than the 10 km triangle and is comparable to 15 km 
triangle (in agreement with the results shown in the panel for AQ9os; in Fig. 10). Looking 
at the events localized with AQggy, < 10deg? (100 deg?), the number of detection for the 
2L with 15 km misaligned arms increases by factor 2 reaching z of 0.3 (1.2) with respect to 
the 10 km triangle, whose well-localized detections reach z — 0.2(0.7). The 2L with 20 km 
misaligned arms increases the number of well-localized BNSs by a factor 4 with respect to 
the 10 km triangle reaching z — 0.4 (1.6) with events localized within 10 deg?(100 deg?). Ta- 
ble 1 shows also the expected number of detections per year within certain sky-localization 
uncertainty selecting events with a viewing angle, O,,'? smaller than 15°. A fraction of 
these events is expected to produce detectable high-energy emission powered by the GRB 
relativistic jet, assumed to be perpendicular to the orbital plane. Also for on-axis events, the 
2L with 15 km misaligned arms performs better than the 10 km triangle and is comparable to 
the 15 km triangle. 


The viewing angle is defined as ©, = 1 for 0 € 1 < 90? and ©, = |, — 180°| for 90? <u < 180? 
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Full (HFLF cryo) sensitivity detectors 


Aflos (deg?) All orientation BNSs BNSs with viewing angle O, < 15° 
A10 | A15 | 2L 15 | 2L 20 | A10 | A15 | 2L 15 2L 20 

10 11 27 24 45 0 1 2 5 

40 T8 215 162 350 8 22 20 33 

100 280 | 764 644 1282 26 74 68 133 

1000 2112 | 5441 | 7478 | 13482 | 272 | 632 | 1045 1725 


Table 1: Expected number of detection (SNR > 8) per year with sky-localization uncertainty 
AQooy%(deg”) smaller than the threshold indicated in the first column. While the columns 2-5 give 
the detections for BNS systems randomly oriented, the columns 5-9 give the detection of on-axis 
events, whose viewing angle is smaller than 15°. The numbers are relative to one year of observation 
assuming a duty cycle of 0.85 as described in the text. 


HF sensitivity detectors 


AQooy% (deg?) All orientation BNSs BNSs with viewing angle ©, < 15° 
A10 | A15 | 2L 15 | 2L 20 | A10 | A15 | 2L 15 2L 20 

10 0 1 5 5 0 0 2 2 

40 4 10 20 4T 0 5 6 17 

100 14 53 76 144 7 33 35 64 

1000 145 | 548 | 1662 | 3378 80 | 336 672 1302 


Table 2: Same as Table 1 but considering the detectors operating with only the HF interferometers. 


Losing the low-frequency interferometers, the HF GW detectors detect a significantly 
smaller number of well-localized events. However, as shown comparing Table 1 and Table 2,6 
the decrease of detections of well-localized events is more severe for the triangle configurations 
than for 2L misaligned. The right column of Fig. 21 shows the redshift distribution of the 
sky-localization uncertainty for the HF 15 km triangle and 2L with 20 km misaligned arms. 
'The comparison with the corresponding configurations HFLF cryo shows that a large fraction 
of well-localized events is already missed at small redshifts. Focusing on the on-axis events, 
the use of only HF interferometers decreases the number of well-localized events but in a 
percentage of a smaller fraction with respect to the events randomly oriented. While the HF 
2L with 15 km and 20 km misaligned arms localize worse than the HFLF-cryo 10 km triangle 
for randomly oriented systems, for on-axis events, the HF 2L with 15 km misaligned arms 
localization capability is comparable to the HFLF-cryo 10 km triangle and the HF 2L with 
20 km better than the HFLF-cryo 10 km triangle. 

The detection of prompt/early multi-wavelength emission from BNS mergers is critical 
to probe the central engine of GRBs, particularly to understand the jet composition, the 
particle acceleration mechanism, the radiation and energy dissipation mechanisms [see e.g. 
67]. Regarding the kilonova emission, it has been shown that the early phase emission is 
particularly sensitive to the structure of the outer sub-relativistic ejecta (see, e.g., [68]) and 


'6The different numbers, comparing Tables 1 and 2 with Table 43 in App. C, reside in the different SNR 
threshold, assumed to be 8 for multi-messenger studies and 12 for the metrics of Section 3 and App. C. The 
larger difference on the HF-only estimates is also related to the different approach of GWFISH (used for the 
Tables 1 and 2) which regularizes the close-to-singular matrices [31] instead of discarding the ill-conditioned 
Fisher matrix as GWFAST (used for the Table 43) [17, 32]. As described in the footnote 9, for the HF-only case 
the percentage of discarded events approaches 1096. 


— 41- 


Full (HFLF cryo) sensitivity detectors 


AQgox, All orientation BNSs BNSs with ©, < 15? 
Configuration 

[deg?] 30 min | 10 min | 1 min | 30 min | 10 min | 1 min 

10 0 1 5 0 0 0 

100 10 39 113 2 8 20 
Al0km 

1000 85 293 819 10 34 132 

All detected 905 4343 | 23597 81 393 | 2312 

10 1 9 11 0 1 1 

100 41 109 281 6 14 36 
Al15km 

1000 279 806 | 2007 33 102 295 


All detected 2489 | 11303 | 48127 221 1009 | 4024 


10 0 1 8 0 0 0 
2L 15 km misaligned oe a us as ! d 
1000 194 565 | 1399 23 73| 199 
All detected | 2172 | 9598 | 39499 198 863 | 3432 
10 2 4 15 1 1 2 
2L 20 km misaligned | 100 39 118 288 7 19 4T 
1000 403 | 1040 | 2427 AT 128) 346 
All detected | 4125 | 17294 | 56611 363 | 1588 | 4377 


Table 3: Number of BNS mergers per year detected (SNR > 8) before the merger within z = 1.5 for 
the different full (HFLF cryo) sensitivity ET configurations. Three pre-merger scenarios (30, 10, and 
1 minute(s) before the merger) are shown in different columns. For each detector configuration, the 
rows give the number of detections with sky-localization (90% c.1.) within 10, 100, 1000 deg?, and all 
the detected sources. The numbers of GW detections per year take into account the ET duty cycle 
as described in the text. 


can give rise to early UV emission detectable by wide-field satellites such as ULTRASAT 
[69], and mission concepts as UVEX [62] and Dorado [70] which can benefit of pre-merger 
alerts. Detecting a BNS signal before the merger makes it possible to eventually detect 
electromagnetic emission precursors. It makes it possible also to image the sky-localization 
before the merger that, especially for optical observations, is extremely useful to remove 
contaminating transients present before the merger. Considering all the benefits of pre- 
merger alerts for multi-messenger astronomy, here we explore the ET capabilities to detect 
BNSs and give a sky-localization before the merger. 


Table 3 shows the number of BNS mergers per year detected (SNR > 8) before the 
merger within z — 1.5, for the different full (HFLF cryo) sensitivity ET configurations. We 
consider three pre-merger scenarios; 30, 10 and 1 minute(s) before the merger. For each ET 
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HF sensitivity detectors 


dono AN oox% All orientation BNSs BNSs with 0, < 15° 

[deg?] 30 min | 10 min | 1 min | 30 min | 10 min | 1 min 

100 0 0 0 0 0 0 

A10km 1000 0 0 4 0 0 1 

All detected 0 3 317 0 0 26 

100 0 0 2 0 0 0 

Aldkm 1000 0 0 10 0 0 4 

All detected 2 8 891 0 1 84 

100 0 0 0 0 0 0 

2L 15 km misaligned | 1000 0 0 7 0 0 3 
All detected 0 T 743 0 1 69 

100 0 0 3 0 0 0 

2L 20 km misaligned | 1000 0 0 13 0 0 6 
All detected 2 11 1535 0 1 146 


Table 4: Same as Table 3 but for the HF sensitivity ET. 


configuration, we give the number of detection with sky-localization (90% c.l.) within 10, 
100, 1000 deg?, and all the detected sources. We show the results for the BNSs randomly 
oriented of our population (columns 3-5) and select among them the BNSs with viewing angles 
smaller than 15° (columns 6-8). The on-axis detections are particularly important to detect 
the prompt/early beamed emission associated with relativistic jets, for example the very high 
energy prompt emission never detected so far from GRBs (see [71] for the perspectives with 
the Cherenkov Telescope Array [72]). Focusing on well-localized events detected pre-merger 
with a sky-localization smaller than 100 deg?, we highlight that the 15 km triangle turns out 
to perform better than the 10 km triangle and the 2L with 15 km misaligned arms, and its 
performance is similar to the 2L with 20 km misaligned arms. The 2L with 15 km misaligned 
arms performs better than the 10 km triangle. The results are similar for on-axis events. 

Table 4 shows the dramatic decrease of pre-merger alerts without low-frequency. For 
all the ET configurations, there are no localized pre-merger detections, except a few events 
with sky localization smaller than 1000 deg? per year one minute before the merger (a few 
< 100 deg? for 2L20km). 


4.2 Gamma-ray bursts: joint GW and high-energy detections 


Following the approach described in [18], we evaluate the expected detections of prompt 
and afterglow emission of short GRBs associated with BNS mergers. We explore the joint 
detection capabilities of the different configurations of ET operating with gamma-ray and 
X-ray satellites. As reference instruments, we use some examples of current and upcoming 
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INSTRUMENT E. d Bis Qa | FoV/Am | loc. ace. Status 
Fermi-GBM 0.01 - 25 0.5 0.75 5 deg Operating mission 
GECAM 0.006-5 | 2x 10-8(*) 1.0 1 deg Operating mission 
HERMES 0.05 - 0.3 0.2 1.0 1 deg Mission concept 
Pathfinder next few yrs 
GRINTA-TED 0.02-10 0.45 0.64 5 deg(**) Mission concept 
ASTROGAM 0.03-200 0.29 0.27 « 1.8 deg Mission concept 
THESEUS-XGIS | 0.002 - 10 | 3x 1075(*) 0.16 « 15 arcmin Mission concept 
Possible launch 2037 


(*) expressed in ph erg cem? s^! 


(**) 6596(4296) of the TED/GW joint detections are detectable by the Hard X-ray Imager HXI on board of 
GRINTA by repointing HXI on the TED detection in 60s (5 min). HXI is expected to localize at order of 30 
arcSection 


Table 5: Instruments characteristics. 


satellites and mission concepts. We analyze different observational strategies: survey mode 
and target pointing. 


4.2.1 Prompt emission 


Starting from our astrophysically-motivated population of BNS mergers we model the high- 
energy signals associated with the mergers making the BNS population able to reproduce 
the rate and the properties of all the short GRBs observed so far (see [18] for details). For 
the fraction of BNS mergers in our population that are expected to produce a jet (about 
20% of the BNS see [18]), we inject both the GW and EM signals and recover them based 
on the detection efficiency of the GW detector and EM satellites, respectively. We define a 
joint detection if the signal is above the detection threshold simultaneously in both the GW 
and EM observatories. To properly compare the capabilities of different ET configurations 
and avoid differences due to the uncertainties in the EM emission, for the whole set of GW 
simulations we associate to each BNS population the same realization of the EM signal 
derived from the MCMC. 

For the detection of the prompt emission, we consider the following y-ray instruments: 
the Gamma-ray Burst Monitor (GBM) on board of Fermi [73], the Gravitational wave high- 
energy Electromagnetic Counterpart All-sky Monitor [GECAM; 74], the High Energy Rapid 
Modular Ensemble of Satellites [HERMES; 64], the Transient Event Detector (TED) on board 
of the Gamma-Ray INternational Transient Array Observatory (GRINTA), ASTROGAM 
[65], and the X/Gamma-ray Imaging Spectrometer (XGIS) on board of the Transient High- 
Energy Sky and Early Universe Surveyor [THESEUS; 63]. The properties of these instru- 
ments are summarized in Table 5. 

Table 6 shows the numbers of joint GW+-ray detections during one year of observation 
for the different y-ray satellites operating in survey mode together with different ET full 
sensitivity configurations. These numbers go from ten to one hundred depending on the 
satellite and ET configurations. We highlight that, while some satellites maximize the number 
of detections (which is useful for population studies) sacrificing the localization capability, 
others with a smaller number of detections give the sky-localization necessary to drive the 
ground-based and space-borne follow-up. This is critical for identifying the host galaxy, 
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Full (HFLF cryo) sensitivity detectors 


Instrument A10 | Aid | 2L 15 | 2L20 | A10 Ais | 2L15 | 2L 20 
Fermi-GBM 3173 | 4211 | 39910 | 4473 | G1 | 8317% | 7943% | 8914% 
GECAM 61133 morer | elt | 96457 SP ZU TE% 807196 
HERMES S | 120430 | 117437 | 132134 555 BEA Tu UA 851596 
GRINTA-TED | 77431 | 10773) | 98731 | 114734 | 577 iy 79* foi 747 9 E% 
ASTROGAM [IR LOI | ae 2370 TA 307% UA 367% 
THESPUS-XGIS | 10% | 1373 | 1973 | 1543 |5749% 79539 aiy si% 


Table 6: Columns 2-5 give the numbers of joint GW-4-4-ray detections during one year of observation 
for different combinations of y-ray instruments operating in survey mode together with different ET 
full sensitivity configurations. The absolute numbers do not assume duty cycle for the satellites. 
Columns 6-9 give the fraction of detected short GRBs which will have a GW counterparts. 


evaluating a redshift, and completely characterising the source and its environment. An 
important piece of information is given in columns 6-9, where we show what is the fraction 
of y-ray bursts detected by each satellite which will have an associated GW signal. Here, 
we find that these fractions go from 5196 — 61% for the 10 km triangle, 74% — 83% for 
the 15 km triangle, 7096 — 7996 for the 2L with 15 km misaligned arms, 8096 — 8996 for 
the 2L with 20 km misaligned arms. The ET 10 km triangle will make possible remarkable 
results for GRB science by detecting GW signals for a large fraction of short GRBs observed 
by y-ray satellites, and it represents an impressive step forward from the LIGO, Virgo and 
KAGRA network, whose BNS horizon strongly limits the ability to detect events with the 
relativistic jet aligned with the line-of-sight (see e.g. [75, 76]). Then, the percentage of GRBs 
with a GW counterpart significantly increases going from the 10 km triangle to 15 km and 
20 km configurations. The 15 km triangle is marginally better than the 2L with 15 km 
misaligned arms. Figure 22 takes as an example HERMES (full constellation) and shows 
the joint detections during one year. The plot shows that the most significant improvement 
is from the 10 km triangle to a 15 km configuration. The 15 and 20 km configurations are 
able to increase the number of joint detection at redshift larger than 0.9 with respect to the 
10 km triangle. This is particularly important to estimate cosmological parameters or to test 
modified gravity at cosmological scales. 

Considering the GW detectors operating with only the high-frequency instrument (see 
Table 7), we find that the percentage of detected short GRBs with an associated GW signal 
significantly decreases for each configuration. The percentages become 29% — 39% for the 
10 km triangle, 54% — 64% for the 15 km triangle, 49% — 60% for the 2L with 15 km 
misaligned arms, 66% — 76% for the 2L with 20 km misaligned arms. In terms of percentage 
of short GRB with a GW signal, the performance of the 15 km triangle and 2L15km HF-only 
detectors is comparable to the 10 km triangle full sensitivity. This is also shown in the left 
plot of Fig. 23. The right plot of Fig. 23 shows that 2L20km HF results to be better than 
the 10 km triangle full sensitivity, but again going to lower frequency significantly increase 
the 2L20km performance making possible to detect the GW signals associated with a large 
fraction (85%) of short GRBs detected by HERMES. 


4.2.2 Afterglow: survey and pointing modes 


This section explores the capability of detecting X-ray short GRB afterglows associated with 
GWs. Using the approach described in [18], we model the afterglow taking into account both 
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HF sensitivity detectors 


Instrument A10 | A15 [2L 15| 2L20 | A10 A15 | 2L15 | 2L20 
Fermi-GBM 2043 | 3348 |2913 | 381 | 39131% | 64117% | 60117% | 7613% 
GECAM 3p ears betes | 77436 | 2915% | 5415% 198% 667 3% 
HERMES 52421 | 91430 | 8342 | 107%% | 33 K 58110% RU 71% 
GRINTA-TED |4617 | 80731 | 74738 | 94733 347% 6117% | 5517% | 7213% 
ASTROGAM 1246 | 1977 | iets | 2313 |3711% 6299; 5713% 7439 9i 
THESEUS-XGIS| 672 | ad*s.| 973 | 198 TU UA 59* 10% 547107; 7143% 


Table 7: Same as 6 but considering the GW detectors operating with only the HF interferometers. 
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Figure 22: Histogram of the joint detection of ET and HERMES observing in survey mode during 
one year. The solid lines of different colors represent the different configurations of ET full sensitivity. 
The dashed line show the prompt emission of short y-ray bursts expected to be detected by HERMES 
during one year of observations. 
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Figure 23: Histogram of the joint detection of ET and HERMES observing in survey mode during 
one year. The plot shows the comparison among ET full sensitivity and ET HF for Al0km and 
2L15km (left plot), and 2L20km (right plot). The dashed line show the prompt emission of short 
y-ray bursts expected to be detected by HERMES during one year of observations. 


the forward shock and the high-latitude emission [77, 78] and we evaluate the joint GW /X-ray 
detections by considering two observational strategies; the satellite observing in survey mode 
or the satellite slewing to point well-localized events. As an example of performance of X-ray 
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Full (HFLF cryo) sensitivity detectors 

Instrument A10 | A15 | 2L 15 | 2L 20 
THESEUS-SXI survey toe | as lo | 127 
THESEUS-(SXI+XGIS) survey | 2178 | 2178 | 2077 | 2147 

HF sensitivity detectors 
Instrument A10 | A15 | 2L 15 | 2L 20 
THESEUS-SXI survey gre [da | gt ay? 
THESEUS-(SXI+XGIS) survey | 1678 | 19T8 | 19t% | 2178 


Table 8: Numbers of joint GW--X-ray detections during one year of observation for THESEUS 
operating in survey mode considering the instrument SXI and the combination of SXI and XGIS. 
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Figure 24: Redshift distribution of joint GW+EM detections with THESEUS-(SXI+XGIS) in sur- 
vey mode, where we distinguish between on-axis and off-axis cases. The ET configuration considered 
here is 2L 20 km HFLF cryo. 


wide FoV satellites, we consider the Soft X-ray Imager (SXI) planned to be on board the 
THESEUS mission [63]. SXI is a wide field of view instrument able to cover 0.5 steradians 
in one observation. It has a localization accuracy of 1-2 arcmin. Since also XGIS has good 
sensitivity down to 2 keV in a larger FoV of 2 steradians, We also consider the combination 
of SXI and XGIS. 

'Tables 8 show the numbers of X-ray afterglow detected in survey mode by SXI and the 
combination of SXI and XGIS on board of THESEUS which will have an associated GW sig- 
nal. While the top Table shows the joint detection with the ET full sensitivity configurations, 
the bottom Table shows the ones with ET HF. The number of joint detections is around 10 
for SXI (20 for SXI+XGIS) per year independently of the arms lengths and the geometries, 
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Instrument A10 cryo | 2L 20 km cryo | A10 HF | 2L 20 km HF 
THESEUS-SXI pointing 6*1 I1 p T 


Table 9: Numbers of joint GW+X-ray detections during one year of observation for THESEUS-SXI 
operating in pointing mode. Only BNS localised better than 100 deg? are followed-up. 


and the number remains almost the same also without accessing low-frequencies. Figure 24 
shows a histogram of the joint GW-+afterglow detections by THESEUS-(SXI+XGIS) in sur- 
vey mode operating with full sensitivity 2L with 20 km misaligned arms as a function of 
redshift. The figure shows that the majority of the joint detections are within z = 1. This 
explains the reason why both arm-length and accessible frequency do not change the joint 
GW/X-ray afterglow detection numbers; the majority of the afterglows are detectable up to 
a redshift where there is no significant difference among the GW detection efficiency. The 
figure also shows that detecting the X-ray afterglow enables to observe events with a jet not 
aligned to the line of sight. 

For the pointing strategy, we select events with a sky-localization uncertainty smaller 
than 100 deg’, which is well-contained inside the SXI FoV, and we assume that SXI is able to 
be on target 100 s after the merger. This is a very optimistic scenario because 100 s should 
include the time to transmit the alert, respond to the trigger and re-point the instrument 
to the source position. Table 9 shows that a few detections are possible and going from the 
10 km triangle to the full sensitivity 2L 20 km increases of a factor of about 2 the detections. 
As described in [18], there are two possible issues for the pointing strategy; being extremely 
fast to point to the source, and prioritising the events to be followed. No X-ray detection 
is possible being on source 1 hr after the merger with respect to 100 s because the X-ray 
afterglow emission is decaying fast. To detect a few events (see Table 9) one needs to optimize 
the event selection among hundreds of events (see Table 1) to be followed-up, to avoid to 
loose observational time. Selecting the sources localized better than 1000 deg?, the number 
of joint GW4-X-ray detections increases by a factor of 10 but the selection of the event to 
be followed becomes more complicated. As shown in Table 1 the number of sources detected 
with sky-localization 1000 deg? are thousands. Also prioritizing the events on the basis of the 
viewing angle by excluding all the sources edge-on (for which the viewing angle estimate is 
more precise), the number of events to be followed up remains to be around several hundreds 
(for the 10 km triangle) to thausands (for the 2L 20 km). While the 10 km triangle gives 
fewer joint detections with respect to 2L 20 km full sensitivity, the trigger selection is easier 
because it is among a smaller number of events. The HF-only 2L with 20 km misaligned 
arms gives a number of joint detection comparable to the 10 km triangle. 

In summary, for the detection of the X-ray afterglow emission associated with the GW 
BNS signals there are no significant differences for the different configurations of ET. 


4.3 Kilonovae: joint GW and optical detections 


This section explores the perspectives for the detection of KNe. The KN emission is not 
beamed as GRBs and is expected from all viewing angles. We consider as reference ob- 
servatory, the Vera Rubin Observatory (VRO). VRO is the innovative 8.4-meter telescope 
characterized by a wide field (9.6 deg?) camera containing over three billion pixels of solid- 
state detectors [58]. Its compact shape enables to point and slew through large sky-region 
extremely fast. It represents the ideal instrument to observe the GW sky localizations ob- 
taining deep observations of extremely high quality. In order to evaluate the number of 
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Figure 25: Light-curves for GW170817-like signals associated with BNS detected with a sky- 
localization smaller than 40 deg? by each of the full (HFLF cryo) sensitivity configurations (four top 
plot) and by each of the HF sensitivity configurations (four bottom plot). The black lines mark the 
(50 point-source depth) limiting magnitudes corresponding to 600 s of exposure for each pointing in 
the g band (solid black line) and the i band (dashed black line). 
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possible joint optical/GW detections, we model the kilonova emission as in [79] taking the 
best-fit parameters obtained to reproduce the multi-filter optical observations of the kilonova 
AT2017gfo associated with GW170817. Our model includes cosmological and k-correction. 
We select all the BNS mergers detected with a sky-localization smaller than a threshold and 
we associate to each of them a GW17087-like kilonova taking into account the source dis- 
tance and the viewing angle (our modeling starting from GW170817 reproduces the angular 
dependence of the expected emission). We assume three threshold on the sky-localization 
by selecting all the events with sky-localization < 20deg?, < 40deg?,and < 100deg?. We 
consider a Target of Opportunity strategy similar to the ones described in [80-82]; we use 
observations in two filters (g and i) the first night after the merger and we repeat the two 
filter observations during the second night after the merger. We consider a detection when 
the kilonova emission is detected at 5o the first and second night in both filters. Having 
observations in at least two filters is important to reduce the number of contaminating tran- 
sients and identify the kilonova associated with the GW signal on the basis of the color 
evolution. The problem of contaminating transients is solved in real observations also using 
spectroscopy able to characterize the source, but here for the configurations comparison, this 
is not taken into account. Our counts take into account the visibility of the VRO. 

Figure 25 shows the light-curves for GW170817-like signals associated with our popu- 
lation of BNSs detected with a sky-localization smaller than 40deg? by the full (HFLF cryo) 
sensitivity configurations and by the HF sensitivity configurations. The plots show the dif- 
ferent numbers of events selected to be followed up for the different configurations. It is 
immediately clear the significant increase in the number of targets to be followed going from 


Full (HFLF cryo) sensitivity detectors 


Configuration Naw,vro VRO Naw,vRO VRO Naw,vRO VRO 
Q « 20deg? time Q < 40 deg? time Q < 100deg? | time 
A10 14 (14) 1.196 (3.396) 36 (39) 5.196 (15%) 96 4096 
A15 38 (42) 3.396 (9.8%) 84 (101) 14.2% (42%) 163 > 100% 
2L 15 28 (28) 2.2% (6.5%) 62 (77) 10.6% (31%) 189 93% 
2L 20 55 (64) 5% (14.9%) 115 (152) 23.1% (68%) 324 > 100% 


Table 10: Numbers of expected kilonovae detected by the VRO selecting the sources detected by 
the different configurations of ET with sky localization smaller than 20 deg? (column 2), 40 deg? 
(column 4) and 100 deg? (column 6). Columns 3, 5, and 7 give the percentage of the VRO time 
necessary to follow up all the sources with sky-localization smaller than the above thresholds and 
within the VRO visibility sky area. The observational time for each event is obtained considering 
600 s for each pointing and a number of the pointing corresponding to the ratio between the sky- 
localization uncertainty and the VRO FoV plus one pointing to be more conservative. We consider 
observations in two filter (g and i) the first night after the merger and the two filter observations 
repeated during the second night after the merger. We add 60 s to take into account filter change, 
slew time and overheads each night. The percentage of VRO time is evaluated assuming 2600 hours 
as the total amount of observational time of the VRO in one year. The expected number of detected 
kilonovae and percentage of VRO time considering 1800 s for each pointing (instead of 600 s) are 
given in parentheses. For the 100 deg? threshold, assuming for each pointing of 1800 s entails that 
the number of hours requested to cover the entire sky-localization is larger than the ones available in 
one observing night. This makes our observational strategy unfeasible, and thus we exclude this case. 
It is also shown that to follow all events with sky-localization « 100 deg?, the required observational 
time becomes prohibitively large and for A15 and 2L 20 larger than the entire VRO observational 
time. 
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HF sensitivity detectors 


Configuration Neaw,vro VRO New,vro VRO Naw,vRO VRO 
Q < 20 deg? time Q < 40 deg? time Q « 100 deg? 

A10 0 (0) 0% (0%) 2(2) |03% (0.8%) n 

A15 2 (2) 0.2% (0.5%) 3 (4) 0.7% (1.9%) 8 

2L 15 3 (4) 0.4% (1.2%) 7 (7) 1.3% (3.9%) 26 

2L 20 5 (4) 0.6% (1.6%) 15 (18) 3.1% (9.3%) 32 20.8% 


Table 11: Same as 10 but for ET detectors without the low-frequency. 


configurations with only high-frequency to the full sensitivity configurations. 

Table 10 gives the number of expected kilonovae by following up all the events within 
the VRO visibility sky-area and with a sky-localization uncertainty smaller than 20 deg?, 
40 deg? and 100 deg?, and the corresponding percentage of VRO time necessary for the 
follow-up. Enlarging the threshold on the sky-localization significantly increases the number 
of events to be followed-up and thus the number of joint detections but at the expense of 
more observational time. Also increasing the exposure from 600 s to 1800 s for each pointing 
tends to increase the number of detections but the percentage of time to be used becomes 
prohibitive. Looking at the full sensitivity detectors, the 2L with 20 km misaligned arms is 
the best performing in detecting kilonovae. It enables to detect between several tens and a 
few hundred kilonova counterparts per year. The 15 km triangle is slightly better than the 2L 
with 15 km misaligned arms giving a number of detection about 30% larger . The 15 km 
triangle is significantly better than the 10 km triangle giving about a factor 2 larger number 
of detections. 

The presence of low-frequency is critical for ET operating as single observatory to detect 
a large number of kilonovae counterparts. 'Table 11 shows the small number (a few) of 
detections per year expected with the triangle-HF configurations. This number increases to 
a few tens for the 2L-HF configurations. 

We highlight that the absolute number per year of VRO detections is affected by the 
error on the BNS local rate normalization. The astrophysical rates inferred from the ob- 
servations of the first, second and third run of observations of LIGO and Virgo give a BNS 
merger rate in the range 10 to 1700 Gpc^?yr-! [7]. Since our population corresponds to a 
local rate Ro ~ 250 Gpc ^? yr-!, the actual numbers could be up to one order of magnitude 
smaller or larger than the ones given in the Tables. 


5 Stochastic backgrounds 


The stochastic GW background (SGWB) is formed by the incoherent superposition of signals 
emitted by different GW sources in our Universe, primordial or astrophysical, at different 
redshifts, that we collect at our detector. Traditionally, we distinguish between an astro- 
physical GW background (AGWB) and a background of cosmological origin (CGWB). The 
latter are tensor modes produced by different processes in the early universe such as infla- 
tion, reheating, phase transitions, cosmic strings, or primordial black holes (see [83-85] for 
reviews). If detected, it would provide us with direct information about the very first in- 
stants of time of the evolution of our Universe. On the other hand, the AGWB is made of 


'’The scenario of following up all the events with sky-localization < 100 deg? case is not considered because 
the required observational time of VRO is prohibitive. 
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the superposition of all GWs emitted by different populations of astrophysical sources, from 
the onset of stellar activity until the present epoch. The CGWB and the AGWB are the 
gravitational counterparts of the Cosmic Microwave Background (CMB) [86] and of the Cos- 
mic Infrared background (CIB) [87] in the electromagnetic context: in the same way as the 
CIB is a foreground for the CMB, which needs to be properly characterised and subtracted 
to access the CMB and its cosmological content, astrophysical backgrounds of gravitational 
waves represent foregrounds for primordial backgrounds of gravitational radiation. Typically 
it is expected that different GW sources are characterized by a different frequency profile, 
that can be used as a tool to disentangle them [88, 89]. As recalled in App. A, a stochastic 
background is characterized by the quantity 


~ m 1 dpgw(A, f) 
Ose (f f) = Pe dln fd?à , 


(5.1) 


where dpgy(n, f)/d1n fd? is the energy density per unit logarithmic frequency and unit solid 
angle that reaches the observer from the direction n and with frequency f, see eqs. (A.2)- 
(A.4). The isotropic (normalized) energy density is defined by 


9s. f) = | PA Meld, f). (5.2) 


and gives the (normalized) energy density of the stochastic background reaching the observer 
from all directions. It is commonly assumed that it is possible to factorize the dependence 
on the frequency and that on the direction, i.e. 


Og (hi, f) = EP À). (5.8) 


Furthermore, for many cosmological backgrounds (but not, for instance, for the backgrounds 
generated by cosmological phase transitions) the scale of variation with frequency is such 
that, over the bandwidth of a ground-based or a space-borne GW detector, the frequency 
dependence can be approximated by a simple power-law E(f) = Ag (f/fret)°, where Ag is 
the amplitude at an arbitrarily chosen reference frequency fref- 

In the presence of statistical isotropy, the statistical properties of the background energy 
density can be characterised in terms of a frequency-dependent angular power spectrum as 


(Qs, (Â, f) Os (8^, f)) = V Celf) Pilà- 80), (5.4) 
£ 


where P;(ü- n’) are Legendre polynomials, functions of the angular separation between the 
two directions ñ and ñ’. The angular power spectrum and the frequency spectrum E(f) 
encode most of the statistical properties of a given background component. Recently, it has 
been shown that, similarly to CMB photons, primordial GWs are also affected by large-scale 
anisotropies (i.e., Sachs- Wolfe and Integrated Sachs-Wolfe effects) [90-93]. Typical CGWB 
components are expected to have the same level of anisotropy as the CMB: a scale-invariant 
spectrum /(/ 4- 1)C, x constant, and a level of anisotropy of the order of ~ 107? with respect 
to the monopole [91]. In contrast, extra-galactic AGWBs have a scaling given by clustering, 
resulting in (+ 1)C; « constant, and a higher level of anisotropy, at the level of ~ 107? 
with respect to the monopole [94-108] (see [109] for a recent public code to compute the 
anisotropies of the AGWB). Interestingly, both the CGWB and the AGWB anisotropies show 
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a correlation with CMB anisotropies which can be used both as a source discriminator and 
for testing systematics [110, 111]. 

While the production of tensor modes from the early universe can typically be described 
in terms of a continuous emission, varying on a time scale extremely small with respect to 
typical observation time-scales,? the AGWB in the frequency band of ET is expected to 
have a so-called popcorn-like nature, due to the discreteness of emissions in time. Indeed, in 
the Hz band, we expect the AGWB to be dominated by mergers of compact objects and, 
considering the short duration of the merger phase, events are expected to be separated 
in time and with a limited time overlap (some overlap is expected for BNS mergers). As 
a consequence, in the frequency band of ET, the angular power spectrum of the AGWB 
from mergers of compact binaries has an important popcorn component (due to the discrete 
emission in time), which adds to the clustering one (due to the discreteness of the spatial 
galaxy distributions) [97, 103, 104, 108]. Formally, the total angular power spectrum is given 
by Goer = Cy + NP?P, where the first term on the right-hand side is the contribution 
from clustering while the second component represents popcorn noise. The latter is flat 
in £-space (it is just an offset) and it is expected to cover the clustering contribution, see 
[97, 103, 104, 108]. Even if popcorn noise contains astrophysical information, it does not 
provide any information about the spatial distribution of sources. A possible way to overcome 
this problem, i.e. to extract the clustering part out of the popcorn noise threshold, is to 
consider cross-correlations between an SGWB map and galaxy distributions, see e.g. [108]. 
'The study of cross-correlations, besides being an independent observable interesting on its 
own, provides one with a powerful detection tool, as it typically has a higher SNR than the 
auto-correlation, see e.g. [96—98, 108, 112] in the context of the extra-galactic astrophysical 
background. 

Finally, kinematic anisotropies induced by a peculiar motion of the observer with respect 
to the rest frame of emission are expected to be quite important and potentially detectable 
if the corresponding monopole has a sufficiently high SNR, see e.g. [113]. Recent studies to 
extract the cosmic dipole using next-generation detectors have been developed both for the 
AGWB [113, 114] and for resolved sources [115]. 

We conclude this section with a remark. Because of the discrete emission in time, the 
AGWB in the Hz band is not irreducible: with a perfect instrument with infinite sensitivity 
(and with infinite computational power), all events would be detectable individually. As the 
sensitivity of next-generation ground-based detectors will improve more and more events will 
be detectable individually: in this context, it is interesting to study the so-called residual 
background, i.e. the background contribution that remains after having filtered out resolvable 
objects. This observable can indeed provide us with a unique way of extracting information 
on a faint and distant population, that cannot be detected with a catalogue approach. We 
will get back to this point later in this section. 


5.1 Sensitivity to isotropic stochastic backgrounds 


We now study the sensitivity, in different configurations, to a SGWB. To this goal, we com- 
pute the Power-Law integrated Sensitivity curve (PLS), whose definition is recalled in App. A, 
for the triangular and L-shaped detectors, in the configurations discussed in Section 2, so 
for the triangular configuration we consider 10 and 15-km arm, while for the L-shaped we 


'8For this reason most of the CGWB components represent continuous and stationary background compo- 
nents and they are often referred to as irreducible backgrounds. 
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Figure 26: Power-law integrated sensitivity (PLS) to QGw(/) for different ET geometries, in the full 
HFLF-cryo configuration. We set SNR;, = 1 and Tops = lyr and we include only cross-correlations 
between detectors, excluding auto-correlations. 


consider 4 configurations: 15 km and 20 km arms, aligned and misaligned (by 45 deg) de- 
tectors located one in Sardinia and one in the Netherlands. We then turn to the angular 
sensitivity for the same ET configurations, i.e. we present how well the various configurations 
can reconstruct the C;(f) (i.e., the angular power spectrum), defined in eq. (5.4). 

The result is shown in Fig. 26 for the different geometries in the full HFLF-cryo config- 
uration, and in Fig. 27 for the HF-only configuration. !? 

We observe that the triangular configurations perform better at high frequency since the 
overlap reduction function remains constant at high frequencies for co-located detectors rather 
than falling to zero. The 2L configurations do better at low frequencies, up to several tens of 
Hz, but only for co-aligned detectors (since this maximises the coherence for the stochastic 
search). The case of 45 deg 2L configurations are universally the worst. 

Fig. 28 compares the power-law sensitivities obtained from the HFLF-cryo and the HF- 
only ASDs for the 2L 15km with parallel arms (left panel) and for the 10 km triangle (right). 
Since Qew(f) is quadratic in the strain, the loss of the low-frequency sensitivity has a huge 


Recall from Section 2 that, for the 2L cases, the alignment between the two detectors has been defined 
using the relative angle a measured with respect to the north of the Sardinia site for both detectors, rather 
than with respect to the great circle passing through the two sites. Therefore the a = 0° and the a = 45° 
cases shown in the figures below do not correspond to the ideal and, respectively, the worst configurations for 
stochastic searches, but have slight offsets from it. In terms of the angle a, the ideally parallel and the perfectly 
misaligned for the configurations correspond to a = 2.51° and a = 47.51°, respectively. This is particularly 
important for the misaligned case. Indeed, a = 47.51° would correspond to essentially zero sensitivity to 
stochastic backgrounds, while we see from Fig. 26 that, for o, = 45°, we already have an interesting sensitivity 
to stochastic backgrounds, so a small offset from the perfectly misaligned configuration is already very useful 
for recovering a part of the sensitivity to stochastic searches (without basically affecting the searches for 
compact binary coalescences, as discussed in Section 2). See App. B for more details. 
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Figure 27: As in Fig. 26, for the HF-only configurations. 
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Figure 28: Left: ET power-law integrated sensitivity for the triangle configuration for the full 
HFLF-cryo configuration and the HF-only configuration, assuming SNR; = 1 and Toy, = lyr and 


including only cross-correlations between detectors. Right: The same for the 2L-15km-0? configura- 
tion. 


impact on the sensitivity to stochastic backgrounds at low frequencies. For instance, for the 
2L 15km with parallel arms, the HF-only configuration is worse than the HFLF-cryo by a 
factor ~ 10? at 10 Hz, and ~ 10? at 3 Hz. In contrast, the sensitivity above about 60 Hz is 
unaffected. 


5.2 Angular sensitivity 


In this section we study the angular resolution of different ET configurations in the context 
of an SGWB. We follow the formalism in [116], whom we refer the reader for details and 
derivations. The sensitivity with which we can measure a given multipole of the background 
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intensity is characterized by a quantity, Ne, such that the signal-to-noise ratio of an angular 
power spectrum is given by 
C 
2. £ 
SNR* = 2 RT. 


(5.5) 


where the C; have been defined in eq. (5.4). The Ne can be computed from 


AC =; » n Eon (5.6) 


ABCD 


where 


g 2 AB CD dim Re | ABC,emlt, ADA m(t f) 
cater p = (DY (up) sy etre Dit), y 
and 

Aansn(t, D) = [ PAY) Anel, fA), (5.8) 


is the angular multipole of the antenna pattern (A.5); we explicitly indicated its time de- 
pendence. These expressions allow one to predict the angular power spectrum of a given 
experiment and scan strategy. A key assumption made in obtaining this result is that the 
time-integrated response of the instrument can be compressed into an angular power spec- 
trum. In practice, this will not be the case, since the scan strategy cannot be designed to 
average the noise over the m-modes and the resulting estimator will be sub-optimal. This is 
a necessary assumption for any analytical estimate; however, in practice, optimal estimators 
will have to account for inhomogeneous noise in m-modes. 

Eq. (5.6) involves a two-dimensional integral over t and f, where the integrand involves 
at least one computationally expensive spherical harmonic transform of the antenna pattern 
A. This can be simplified further for experiments with a constant configuration (e.g. constant 
arm lengths, angular separations between arms, relative detector positions). As explained in 
[116], in this limit the network changes as a function of time as a rotating solid. Hence the an- 
tenna patterns at different times are related to each other through simple three-dimensional 
rotations (i.e. A(t, f, à) = A( f, Rñ), where Rz is a time-dependent rotation matrix). Recall- 
ing that a rotation only mixes the m-modes for each fixed £ in Aem, the function G; in (5.7) 
does not depend on t (assuming detector noise to be stationary). In this case one gets the 


compact expression 
E Tob AB,CD 
NS » pac (F), (5.9) 
ABCD 


where Tops is the total observing time. 

In Fig. 29 we plot the noise per multipole Nọ at a reference frequency fref = 100 Hz for 
different detector configurations. When looking at odd multipoles, we see that the 2L config- 
urations are much more sensitive than triangular configurations. This is due to the fact that 
the three nested interferometers composing a triangular detector are separated by a baseline 
whose size is one arm length, while the two L-shaped detectors of the network have a much 
larger baseline. As a consequence, the antenna pattern function of triangular configuration 
is almost completely even under parity, hence the instrument has a very bad sensitivity to 
odd multipoles, see [116]. Furthermore, as expected, aligned detectors have better sensitivity 
than misaligned detectors. We observe that, with the most favourable configurations, i.e. 2L 
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Figure 29: Noise per angular multipole at a reference frequency of 100 Hz for the four configurations 
under study. In the left and right panel we show the sensitivity to odd and even multipoles, respec- 
tively. All figures are obtained using the schNell package https://github.com/damonge/schNell. 


aligned with 15km or 20 km length, we are potentially able to detect a dipole anisotropy at 
the level of ~ 1071? (hence Ns, ~ 107?4), which is the size of the kinematic dipole expected 
from a background component on the edge of being detected by LVC at O5 sensitivity, see also 
[113]. 9^ 


5.3 Astrophysical backgrounds 


Binary mergers nowadays detected by LIGO-Virgo-K AGRA are only the tip of the iceberg. 
In fact, a large fraction of the events at larger distances are not resolved by the detectors and 
are expected to constitute the dominating contribution of the astrophysical background. For 
compact binary coalescences (CBCs), eq. (5.1) can be rewritten in term of energy density 
flux as [117] 


Qe (f) = IF), (5.10) 
where F(f) is defined as 
e d. Er. 
k=1 


and the sum is over the N individual merger contributions arriving on the detector during 
an observation time T. Rewriting the energy density, one can express the spectrum Qew(f) 
in terms of frequency domain waveforms: 


nc? 


Ogw (f) = 2G pe 


N 
GPO YS P Hi (5.12) 
k=1 


with h f) the sum of the squared Fourier domain GW amplitudes for the two polarization 
+/x: . . E 
helf) = hial) + hke). (5.13) 


?"We observe that the plateau at high-£ values is symptomatic of the fact that we have lost sensitivity to 
those multipoles, see [116]. 
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Figure 30: Background prediction for the current state-of-the-art astrophysical models, assuming 
a metallicity spread oz = 0.3. 


Figure 30 shows predictions for the background, derived from the current state-of-the-art of 
population synthesis models [43, 44, 47, 48, 50, 52, 54] with a metallicity spread oz = 0.3 
for the three types of binary: binary neutron stars (BNSs), binary black holes (BBHs), and 
neutrons star-black hole systems (NSBHs). The BNS and BBH population are the ones 
described in Section 3 and used through this paper to evaluate general metrics and several 
specific science cases. On these plots are also shown the PLS for the main configurations of 
this study, already shown in Fig. 26.7! The low-frequency part of the spectrum is predicted 
to have an evolution proportional to f?/3, typical for the inspiral phase of compact objects 
binaries. This shape is common to all CBCs, therefore a sign of a specific population on 
the background is given by deviations from the /?/? behavior. For example, in Fig. 30 the 
deviation of the total background ('AII', dark blue curve) from the f 2/3 shape between 90- 
500Hz is due to the inclusion of BBHs from the cluster, while the deviation above 1000Hz is 
coming from the BNSs. Consequently, the background will be a new observation channel for 
population studies of compact objects. 

Stellar CBC background challenges in ET can be divided into three main categories: 
the search of Population III stars, the study of BBH formation channels combined with the 
star formation history (SFH), and the residual BNSs background. All these categories are 
represented on Fig. 31 at the location of predicted background signatures, together with the 
ET PLS for the different geometries considered (all taken with their best ASD, see Section 2). 


BBHs channels + SFH: This part of the spectrum is where (depending on their masses 
and spins) individual BBHs stop emitting gravitational waves, so that the overall spectrum 
drops. Two main factors can imprint some signature in the range [100-1200] Hz: (1) the SFH, 
which is a combination of metallicity /redshift, and star formation rate/redshift relations; and 
(2) the formation and evolution channel of the binary (ie. the environment of the binary 
formation/evolution). The detection of these signatures will be a new channel of observation 


?' Here, and in Fig. 31, the PLS are computed including only cross-correlation between detectors, and setting 
SNR;; = 1 and Tops = 1 yr, as in the plots of Section 5.1. 


— 58 — 


1M Ter A 
y gy j; 
‘ BBH channels? 
10-? + SFH E f; 
qu fÉ 
107, //'BNS mergers 
| (residual) 
E 10-10 
e Pop. Ill 
1071 
RSS. ee — Triangle 10km 
N:e -——— e^ = T Triangle 15km 
de VAI EAE Á— 2L 45* 15km 
uis Ja ee ---- 2L45* 20km 
EN Hu eee CA 
eee E ZI S o 2L 0* 15km 
E ---- 2L.0° 20km 
-13 a : . . . 
10 qoo 10! 10? 105 


f [Hz] 


Figure 31: Location of the predicted signatures for the three main challenges for the CBC back- 
ground. 


for the stellar formation and evolution [118, 119]. For this specific signature the triangle 
configuration is preferable to any of the four 2L configurations that we have considered (15 
or 20 km arm-length, aligned or at 45? ), because its sensitivity to stochastic backgrounds will 
be better for frequencies above 200 Hz. 


Population III: Population III stars are the first, metal-free stars that formed in the 
Universe [120]. The peak of the merger rate density of compact remnants left by Population 
III stars is expected to be at high redshift, z ~ 8 — 16 [121-125]. All of the unresolved 
population III merger events then combine to an SGWB in the [5 — 200] Hz frequency range 
that may deviate from the usual f?/? power-law behaviour [126-130]. This is due to the 
fact that, while population I/II stars produce an SGWB in the LIGO/Virgo frequency band 
during their inspiral phase, population III stars would instead show up at the merger and 
ringdown phases. In the frequency range where population III stars could leave their imprint, 
the 2L configurations with parallel arms are the best ones. 


BNS residual mergers: The background, as it is defined, contains only non-resolved 
gravitational waves. In that sense, the residual background (background obtained after sub- 
traction of all resolved sources”) will reveal the BNSs contributions to the background. 
Figure 32 shows the residual backgrounds for each type of binary and some of the configura- 
tions. As for BNSs, they are more difficult to be resolved due to their low masses, and their 
gravitational waves will mostly remain in the background and dominate it [118, 126]. 


5.4 Impact of correlated magnetic, seismic and Newtonian noise 


Since searches for a stochastic gravitational-wave background (SGWB) [131] typically rely 
on cross-correlating data from two or more detectors, they are also susceptible to correlated 
noise. In this section we will discuss the impact of different noise sources on the ET and how 


?2We assumed an SNR threshold of 12 to define a source as resolved. The resolved sources are then 
subtracted to obtain the residual background. 
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Figure 32: Backgrounds from all types of binaries (BBHs+BNSs+NSBHs), compared to the sen- 
sitivity of various detector configurations. The blue thick line corresponds to the total background, 
while the violet ones to the residuals (i.e. the background obtained after the subtraction of resolved 
sources) for the different configurations. 


they affect the different configurations, that is triangular versus 2L configuration. In Sec- 
tion 5.4.1 we will discuss the effect of correlations in the seismic and accompanying Newtonian 
noise (NN), whereas Section 5.4.2 focuses on correlations in magnetic field fluctuations. 

In the following sections, we have used the ET-D design sensitivity in a triangular design 
with arm lengths of 10 km. However, the discussion depends minimally on this assumption 
and is a good representation for the comparison of the triangular versus 2L configurations 
regardless of the exact arm length and sensitivity. 

At the end of the section, in Table 12 the frequency regions of searches for an SGWB 
affected by correlated noise are summarized. We want to point out that here we only discuss 
the effect of fundamental noise sources, which implies that infrastructural noise sources will 
have to be reduced to a similar or lower level than the fundamental noise sources. This is, to 
some extent, discussed for magnetic noise in Section 5.4.2; for all noise sources, i.e magnetic, 
seismic and Newtonian noise, this should be investigated further. Furthermore, we stress 
that most of these correlated noise investigations focus on the effects of the search for an 
isotropic SGWB. However, as shown in a recent study on the effect of magnetic glitches due 
to lightning strikes [132], some correlated transient events might also affect other searches, 
which should be further investigated for seismic and Newtonian noise. 


5.4.1 Seismic and Newtonian Noise 


During the last decade, much research has focused on understanding the effect of seismic 
and Newtonian noise on the sensitivity of ET [133-137]. In a recent study the possibility 
of correlated seismic and Newtonian noise is investigated for the ET triangular configura- 
tion [138]. Searches for a SGWB using data from the second-generation gravitational-wave 
interferometric detectors, LIGO and Virgo, are considered to have no correlations in the seis- 
mic ambient field due to their large separation and the damping of seismic waves. However, 
in the triangular design of the ET the input and output mirrors of two interferometers are 
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proposed to be separated by distances of only several hundred of meters. As illustrated in 
Fig. 11 of [138], there are five coupling locations between two ET interferometers, of which 
two are between aligned mirrors and the remaining three between mirrors rotated by 60°. 
Based on the ET design report [139] the horizontal separation between those mirrors is about 
300—500 m for the aligned mirrors and 330—560 m for the other pairs [138]. 


To investigate correlations between seismic measurements over a scale of several hun- 
dreds of meters, ref. [138] analyzed data at the Earth's surface from geophones deployed near 
Terziet (NL) [140] and underground data from seismometers in the former Homestake mine 
(South Dakota, USA) [141]. The study concludes that at the surface at Terziet significant 
(above Gaussian noise background) seismic correlations exist between sensors with a separa- 
tion of several hundreds of meters. The 50th percentile coherence is significant up to about 
10 Hz, whereas the 9096 coherence is significant up to 50 Hz, which is the highest recorded 
frequency. For 9096 of the time at Homestake there is significant coherence up to 20 Hz 
and more than 5096 even up to 40 Hz; that is the highest frequency before the response of 
the sensors decreases drastically. Neither the coherence nor the cross-power spectral density 
(CSD) were observed to have a dependence on distance (between 200m and 800m). At both 
locations night times are observed to be quieter, with lower CSDs. At night there are more 
periods when there is significant correlated noise. These studies indicate that there is a non- 
negligible amount of coherence during a significant amount of time (night and day), up to 
frequencies of 50 Hz. This leads to the conclusion that the effect of these correlated noise 
sources should be understood, particularly for searches for an SGWB. 


Note that in [138] it is emphasized that some correlations are considered to be of an- 
thropogenic origin. Furthermore, some of the power spectral densities (PSDs) observed at 
the candidate sites are lower than the CSD observed in the Homestake mine, and therefore 
site-specific studies should be envisioned in the future. However, the results discussed in [138] 
are expected to give a good indication of the order of magnitude of the possible levels of corre- 
lated seismic and Newtonian noise which might be expected in an underground environment. 
Homestake represents an underground environment where both ambient and anthropogenic 
noise sources are present. This may also be expected for ET, since its infrastructures could 
be expected to introduce additional anthropogenic noise, regardless of various attempts to 
reduce and shield them. 


It is possible, based on the measurements of correlations of the seismic noise, to estimate 
the correlated Newtonian Noise (NN). The NN is a disturbance produced in the gravitational- 
wave detector by local fluctuations of the gravitational field. Two types of seismic waves 
produce NN: the first are Rayleigh waves which propagate at the surface of the Earth, and 
the second are body waves present in the Earth's mantle. The NN from Rayleigh waves 
decreases significantly with depth, so their effect on ET will be limited if one assumes an 
underground facility at 300 m. As shown in Fig. 15 of [138], the NN from Rayleigh waves 
significantly impacts the power-law integrated sensitivity of ET only up to about 5 Hz.?? At 
higher frequencies, no effect is expected. 

'The median contribution from correlated body waves, however, is expected to be three 
to five times higher than the ET sensitivity, in the range of 2 Hz to 10 Hz. While it is 
important to investigate the impact of noise on the sensitivity curve, correlated noise can 
more drastically affect some searches. The search for a SGWB typically relies on the cross- 


?5'The power-law integrated sensitivity is defined as in [142], see also App. A. We use an SNR of 1 after one 
year of data at a 10096 duty cycle. 
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Figure 33: The projected impact from correlated NN from body-waves Cyn,y, as calculated in 
Section III of [138], is shown in blue. The blue line represents the median value, and the associated 
surface is delimited by the 1096 and 9096 percentiles. Also curves with more optimistic assumptions 
(see the text) are shown. Based on Fig. 16 of [138]. 


correlation of data from two detectors to decrease the Gaussian detector noise and improve 
the search sensitivity. 

Fig. 33 presents the expected impact on the search for a SGWB due to NN produced 
by body waves.?^ It shows that the budget calculated in [138] affects the SGWB search up 
to at least 40 Hz (blue band), which is the highest frequency in the analysis. Near 3 Hz, this 
gives an effect which is larger than the 1 yr ET sensitivity curve for the SGWB (light brown 
line in Fig. 33) by a factor ~ 8 x 10° (90% percentile), or ~ 6 x 10? (50% percentile). 


According to the noise estimate shown in blue in Fig. 33, with the triangular configu- 
ration the sensitivity of ET to an SGWB, at least below O(50 — 100) Hz, would be of the 
same order of magnitude as LIGO’s A+ and Virgo’s AdV+ design. Even assuming that an 
optimistic factor of 10 NN cancellation [133] could be applied to each interferometer, i.e. a 
factor of 100 for the SGWB budget, the search for an SGWB would still be impacted up to ~ 
30 Hz. When considering the night-time data with fewer anthropogenic activities, the impact 
decreases but stays at the same order of magnitude, i.e. a factor ~ 10? — 10° at 3 Hz [138]. 

In Fig. 33 we also provide the result from a more optimistic scenario (pink curve). 
'The construction of this scenario is broken down as follows. We start from the yellow curve, 
which displays the affect of NN from body waves assuming a seismic spectrum identical to the 


24To produce this figure, for the broadband (ORE anie ORBE g yo) sensitivity to an SGWB we assumed one 
year of observation time (10096 duty cycle). The one-year PLS curve of the A+ design for the LIGO Hanford, 
LIGO Livingston detectors and AdV+ design for the Virgo detector is represented by the dot-dashed curve. 
LIGO's A+ and Virgo's AdV4- design are the interferometer designs which are aimed to be reached during 
O5 [23]. This curve was obtained using the open data provided by the LVK collaborations [143] and was 
first presented in [144]. Note that in this figure we present the 1o PLS, whereas in [144] the 20 PLS-curve is 
shown. 
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Peterson low noise model [145] and 10096 coherence. We then assume a NN cancellation by a 
factor of 3 (i.e. a factor of 9 on the stochastic budget), which is considered as a realistic target 
(green curve). The dark orange curve is generated from the assumption that the coherence 
equals to the median coherence observed in Homestake, as reported in [138]. Finally the 
pink curve represents the Peterson low noise model multiplied by the coherence observed 
at Homestake, as well as a factor of 3 NN cancellations for each detector. This pink curve 
represents the most optimistic scenario that one can achieve. In this case, SGWB searches 
will still be affected by correlated NN up to ~10Hz. 

The effect of correlated noise presented above, as well as possible additional effects from 
infrastructural noise, would only affect the search for an SGWB in the case of the triangular 
design leading to closely located interferometers. The problem of co-located correlations 
(of order a few hundred meters) of the seismic field should represent no problem for very 
distant L-shaped interferometers. However, for a triangle one can take advantage of the 
“null-channel” to estimate the power spectrum [146, 147], see the discussion in Section 7. 
In the future, site-specific studies could give more detailed information. Furthermore, it 
would be important to estimate the order of magnitude of correlated noise expected by 
infrastructural noise sources caused by the ‘typical equipment’ needed for an underground, 
cryogenic gravitational-wave interferometer. 

Finally, the effects of seismic transients such as (micro-)earthquakes should be under- 
stood. The latter would not only affect the search for an SGWB but could also be expected 
to affect searches for other GW signals such as CBCs, bursts or continuous waves if the noise 
sources are too loud. Further studies on the possible effect of correlated seismic transients 
would also be advisable. Such a study could be envisioned to investigate the possibility of 
lock-loss, similar to an earlier study for LIGO [148], or the impact on the sensitivity, similar 
to a study of the effect from earthquakes as part of the O3 environment investigations done 
at Virgo [149]. In the studies for ET one should also look into the possibility of correlations, 
as well as the ‘transient Newtonian Noise’, linked to the transient seismic fields and how 
these might introduce additional effects on the different signal searches. 


5.4.2 Magnetic noise 


One noise source commonly investigated with respect to the search for an SGWB is the 
Schumann resonances [150, 151]. These are standing electromagnetic waves in the cavity 
formed by the Earth’s surface and the ionosphere, and are driven by lightning strikes all 
across the globe. The fundamental mode has an amplitude between ~ 0.5 pT and ~ 1 pT 
depending on the location and time of the day and year. The resonances are several Hz wide 
and have frequencies ~7.8Hz, ~14Hz, ~21Hz, ... 

To explain how magnetic fields couple to the interferometer, it is instructive to under- 
stand how the coupling is measured. On a regular basis, strong magnetic fields are generated 
in the experimental buildings of the current generation of interferometric detectors. By in- 
jecting fields many times stronger than the ambient magnetic field one tries to measure their 
effect on the strain ASD. We define Y and X to be the ASD of the strain and witness channel 
(magnetometer) respectively. We use the subscript ’inj’ and ’bck’ refer to times during the 
injection, respectively a quiet background time just before/after the injection. Then we can 
measure the magnetic coupling function K as follows [152], 


Yo) — Yea (7) 
Kmeasured (f) = Ja (f) — XA) 3 (5.14) 
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'The coupling function describes how strong the strain data will be affected given ambient 
magnetic fields. 


In recent work, the effect of the Schumann resonances and other forms of correlated 
magnetic noise on ET was investigated [153]. At low frequencies (« 30 Hz) it was shown 
that the magnetic coupling function has to be improved by two to four orders of magnitude 
compared to the second-generation detectors Advanced LIGO and Advanced Virgo in order to 
prevent significant coupling. Note that compared to the Advanced Virgo coupling function, a 
factor of 5 improvement could be expected due to the larger mirror mass of the ET [154, 155]. 
However, this assumes that the same actuator magnets as with Advanced Virgo will be used, 
which will make the control of the test mass more difficult. Furthermore, at low frequencies, 
the Advanced LIGO magnetic coupling is lower compared to the Advanced Virgo magnetic 
coupling. Therefore the electrostatic actuator design used by Advanced LIGO should be 
considered for ET. If ET will rather have a magnetic coupling similar to that of Advanced 
Virgo, the impact of correlated magnetic noise is predicted to be larger and will impact 
searches at a wider frequency band, up to f < 50 Hz. One must also stress the danger that 
induced eddy currents close-by to sensitive detector parts pose, so these should be minimized 
as much as possible. 


The effect of magnetic noise is shown in Fig. 34, based on Fig. 6 of [153]. It presents 


the ratio of KE D and the magnetic coupling measured at LIGO and Virgo. The same 


reatio is also whon for os cryo: Here KE Dy and ke cryo BRE the magnetic coupling 


for each ET interferometer such that no effect of correlated magnetic noise is present in the 
search for an isotropic SGWB using one year of data with 10096 duty cycle at the HF-only, 
respectively HFLF-cryo design sensitivity. For more details on the computation, we refer the 
reader to [153]. The needed improvement of the magnetic coupling such that magnetic noise 
is not larger than 1/10 of the ASD ?^, is roughly a factor 10 less stringent than the SGWB 
constraint. For more information see Fig. 6 and the accompanying discussion of [153]. Note 
that in [153] was not taken into account a factor of 5 of reduction obtained by increasing 
the mirror mass. In the top left panel of Fig. 34 we applied this additional factor-of-5 
reduction for Virgo's magnetic coupling below 100 Hz. At higher frequencies, the coupling 
function is dominated by different mechanisms. We note that in the recent LIGO-Virgo third 
observing run (O3), for which we use the magnetic coupling function, an additional coupling 
was observed [152] compared to the earlier work describing the inverse proportionality with 
mirror mass [155]. 'Therefore it is not guaranteed that this reduction could be achieved if the 
other coupling mechanisms are not properly understood and mitigated. 


Up to ~ 20 Hz, the observed amplitude of the magnetic CSD is similar, regardless of the 
separation between the detectors [132]. At higher frequencies, the magnetic CSD decreases 
due to stronger attenuation of electromagnetic fields travelling long distances. Therefore the 
effect of correlations in magnetic field fluctuations up to ~ 30 Hz on ET is (mainly) indepen- 
dent of the detector configuration. Both the triangular and 2L designs will be impacted at 
the same order of magnitude. However, it is important to understand the distance between 
the detectors and large thunderstorm regions. Regions with large thunderstorm activity are 
the Americas, sub-Saharan Africa and southeast Asia [156], but more research is needed 
to investigate the ‘local’ activity in Europe and the possible impact on gravitational-wave 
interferometric detectors. 


?»'l'his is a typical requirement for noise sources which are considered not to be a "fundamental' noise source, 
i.e. part of the calculation of the optimal sensitivity curve. 
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Figure 34: Top panels: needed improvement factor as a function of frequency for the upper limits 
on the ET magnetic coupling function, such that the isotropic SGWB search is unaffected. LHO and 
LLO refer respectively to the LIGO Hanford and LIGO Livingston observatories. The low-frequency 
(left panel) magnetic coupling poses a greater challenge for the operation of ET compared to the 
high-frequency (right panel) magnetic coupling. In both panels, the dash-dotted blue line indicates 
the line where no improvement is necessary. This figure is based on the two right panels of Fig. 6 of 
[153], however, we have used an additional factor of 5 in the reduction due to the larger mirror mass 
when going from Virgo to ET, in line with earlier work [154, 155]. This only applies to the left panel, 
since at high frequencies the coupling is dominated by other mechanisms. We report the upper limits 
for both the HF-only and HFLF-cryo sensitivity for SGWB searches (in [153] they used ET-B and 
ET-D). Note that, for f > 100 Hz, HF-only and HFLF-cryo have the same sensitivity to a SGWB, 
so we only present the HFLF results in the top left panel. 

Bottom panel: the upper limits on the maximal allowed magnetic coupling to prevent impact on 
the ASD and/or SGWB searches in the pessimistic scenario where magnetic infrastructural noise, as 
measured in the central building at Virgo, will be fully correlated between interferometers. The upper 
limits are provided for both HF-only and HFLF-cryo. The upper limits are compared with the mean 
observed magnetic coupling at the LIGO and Virgo detectors. Based on Fig. 7 of [153]. 
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To summarize the effect of fundamental magnetic noise on HFLF-cryo ET, we can 
state that at low frequencies (<30 Hz) an effect of up to O(10?) will be present on the ASD 
and up to O(10*) on the sensitivity for the SGWB. This is (mainly) independent of the 
triangular versus 2L configuration, due to the low attenuation of electromagnetic fields at 
these frequencies. Despite the dramatic effects of correlated magnetic noise, one can consider 
several approaches to reduce the magnetic coupling, as will be further discussed at the end of 
this section. 

At higher frequencies, minimal effects are expected [153], however, these are more de- 
pendent on the configuration. The co-located triangular design will be more susceptible 
compared to two spatially separated Ls. Even more important is the location of the detector 
with respect to active thunderstorm regions. 

The effect from individual lightning strikes is also predicted to impact the ET sensitivity 
[132], in case of similar magnetic coupling as at LIGO and Virgo. A glitch rate of O(10°) for 
correlated magnetic transients per week could be expected, with the potential to affect ET’s 
detector sensitivity <50 Hz [132]. It might also impact background estimation for CBC and 
burst searches. The 2L configuration is less prone to effects coming from coherent lightning 
strikes compared to the triangular design. 

Depending on which coupling function is used, the magnetic contamination at low fre- 
quencies (<50 Hz) is roughly at the same level (or worse) than the NN from body waves as 
projected in Fig. 33. However, there is a crucial difference between the two scenarios: the 
effect of the NN can only be mitigated by noise cancellation methods. For the magnetic noise, 
noise cancellation is also an option [157], however one can also directly attempt to improve 
the magnetic coupling function. An earlier study investigated the possibility of additional 
magnetic shielding and considered several configurations of Helmholtz coils or a spherical 
shell made out of chain-link metal [158]. This study showed that a factor-of-10 reduction in 
the magnetic coupling function should be feasible by several different shielding approaches. 
These results are promising, given the ‘simple’ methods proposed. One can consider doing 
better by using mu metal or superconducting coils, however, this will come at a significantly 
larger cost. Furthermore one can seek to reduce the number of magnets (or metallic parts) 
attached to the lower stages of your suspensions and test masses [159]. Even better would be 
to move magnets and metallic structures higher up in the suspension chain, leading to the 
dampening of the magnetic fields by the respective pendulum stages. Finally, the induction 
of currents in cables of e.g. the actuator system [159] might be mitigated by using optical 
fibres where possible, definitely in sensitive locations. More research is needed to obtain 
reductions in magnetic coupling by different methods. Nevertheless, one could envision sub- 
stantially decreasing the effect of magnetic fields by using a combination of the proposed 
methods, which might however come at a non-negligible investment cost. We would also like 
to point out that the study [153] showed that ET’s sensitivity curve might be affected up 
to ~ 15Hz-20Hz (top left panel of Fig. 6 of [153]), and some additional shielding and/or 
redesign of the suspensions is needed to prevent any limitations on ETs sensitivity curve. 

The previous discussion assumes all effects from infrastructural noise are overcome. 
This infrastructural magnetic noise is typically about two orders of magnitude louder than 
the fundamental magnetic noise; for example, note the effects in second-generation detectors 
such as Virgo’s central building [153]. In the case of a very pessimistic assumption, where the 
magnetic noise in ET is the same as in the Virgo central building and it is fully correlated 
between two ET interferometers, an additional one to two orders of magnitude improvement 
of the magnetic coupling function will be required. This would only apply to the co-located 
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SGWB searches 
Noise type Triangle 2L 
Seismic ambient Yes, X 4 Hz 6 No 
NN ambient - Rayleigh waves Yes, € 5 Hz 2” No 
NN ambient - Body waves Yes, S 40 Hz No 
Magnetic ambient Yes, < 30Hz, possibly > 100Hz 7° | Yes, X 30Hz 


Table 12: Frequency regions over which searches for an isotropic SGWB would be affected, con- 
sidering different types of correlated noise and assuming no improvements in the magnetic coupling 
function with respect to Advanced LIGO and Advanced Virgo. Furthermore, in this Table, we only 
mention fundamental noise sources and we neglect the potential effects of local (infrastructural) noise. 
Whereas more research is needed to address these noise sources in the case of seismic and Newtonian 
noise, for magnetic noise an initial study [153] showed that the correlated noise might affect the entire 
frequency band for which measurements of the magnetic coupling exist, i.e. between ~ 10 Hz and ~ 
675 Hz. This would only affect the search for an SGWB in the case of the triangular configuration. 


interferometers, such as the ET triangular configuration. Furthermore, this would imply 
that the entire frequency range over which measurements of the magnetic coupling exist 
(~ 10 — 675) Hz, would be affected if no improvement in the magnetic coupling is achieved 
with respect to Advanced LIGO and Advanced Virgo. In that scenario, the magnetic noise 
has the potential to limit the ET sensitivity, as shown in the bottom panel of Fig. 34 [153]. 


This would imply that even greater investments might be needed to sufficiently reduce 
the magnetic coupling in the case of the triangular configuration, as compared to the 2L 
configuration. Further research is needed to understand to which level these infrastructural 
noise sources could be produced and correlated between the ET interferometers. 


?6 Assuming a horizontal-to-vertical and vertical-to-horizontal coupling < 10^ !? above 4Hz. Tilt measure- 
ments and coupling was not taken into account. For details see [138]. 

27 Assuming an underground facility located at a depth of 300m. For details see [138] 

?8 This assumes a magnetic coupling function similar to LIGO Hanford. For a coupling similar to Virgo 
magnetic contamination is possible X 50Hz and = 100Hz. Furthermore, additional research is needed with 
respect to lightning activity in Europe and the effect might be site-dependent. For more details see [153] and 
[132]. 
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6 Impacts of detector designs on specific science cases 


We now discuss how different geometries, or different ASDs, affect specific aspects of the 
Science Case, examining a broad set of particularly significant questions in fundamental 
physics, cosmology, and astrophysics. 


6.1 Physics near the BH horizon 


A cornerstone of fundamental physics is testing the nature of compact objects and the grav- 
itational interaction in the relativistic, highly-dynamical regime at the scale of the BH hori- 
zon. In vacuum GR, the Kerr geometry [160] is the unique physically acceptable equilibrium, 
asymptotically flat BH solution [161—163]. Therefore, deviations from the Kerr metric require 
either modified gravity or specific matter fields within GR. 

In the following, we discuss the impact of detector designs on specific tests of the nature 
of compact objects and the physics at the BH horizon. 


6.1.1 Testing the GR predictions for space-time dynamics near the horizon 


'The ringdown waveform originates from the perturbed remnant object, and can be approx- 
imated as a superposition of damped sinusoids characterized by the complex quasinormal 
modes (QNMs) of the remnant. The frequencies, damping times, amplitudes, and phases of 
QNMs depend on the binary progenitors and on the underlying theory of gravity. 

There are countably infinite QNMs indexed by (l, m, n), where (l, m) denote the angular 
dependence of the mode and n — 0,1,2 is the overtone index (with n — 0 labelling the 
fundamental tone). The (l, m, n) = (2,2,0) is the dominant mode in a binary BH ringdown 
signal and the excitation of the subdominant modes depends on the initial configuration of 
the progenitor binary [164—170]. 

If the remnant is a Kerr BH and the underlying theory is GR, the entire QNM spectrum 
is fully characterized by its mass My and dimensionless spin xy. The BH spectroscopy 
program [166, 171-175] aims at detecting more than one QNM, providing us with multiple 
independent null-hypothesis tests of GR. In addition, even if the remnant is described by the 
Kerr solution but the underlying theory is not GR or the progenitors are not BHs (for example 
NSs or more exotic objects), then the QNM amplitudes and phases would be different from 
the GR prediction [170]. Thus, measuring the ringdown modes in the post-merger signal 
of a binary coalescence provides a clean and robust way to test GR. and the nature of the 
remnant. 

'The fundamental QNM frequency and damping time have been measured by the LVK 
Collaboration only for a few events, providing an independent measurement of the mass 
and spin of the remnant which is in agreement with what inferred from the inspiral-merger 
phase [8]. The first GW event, GW150914, stands out among those with the highest accuracy. 

If measured with ET, GW150914 would have had a ringdown signal-to-noise ratio be- 
tween 141 and 240 depending on the detector configuration, see Table 13. It would therefore 
constitute a golden event with exquisite precision in the measurability of the QNM spectrum. 
We can quantify the measurability of a GW150914-like event with ET using a Fisher matrix 
formalism [173, 176]. We include the first three subdominant modes (3,3,0), (2, 1,0) and 
(4, 4, 0), modeling their amplitudes through the numerical fits in [166, 170], and assuming a 
start time for the ringdown at t ~ 10M after the peak of the waveform. The relative un- 
certainties on the dominant frequencies fo») and damping times 7299 are prospected to scale 
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SNRawi50914 || HFLF-cryo | HF-only 
A-10 km 141 141 
A-15 km 190 190 

2L-15 km-0? 196 196 

2L-15 km-45? 192 192 

2L-20 km-0? 240 240 

2L-20 km-45? 235 235 


Table 13: Projected ringdown SNR of GW150914 as detected by ET with different detector ge- 
ometries (left column headers) and PSDs (top row headers). As explained in the main text, we do 
not observe differences between different PSDs because the signal is dominated by a monochromatic 
component at a frequency insensitive to the LF part of the PSD. 


with the ringdown SNR as 


A^ 100 A 100 
Afmo Qoo ( 100. mU sais ge (6.1) 

Í 220 SNR T220 SNR 
which propagate similar relative errors on the mass and spin of the remnant (assuming the 
latter to be a Kerr BH) thus giving an estimate of the performance of consistency tests in- 


formed by the inspiral part of the signal [8]. On the other hand, the subdominant frequencies 
scale as 


A f330 100 A fz10 100 A fasso 100 
£330 ^ E3 o fno ^i (xn) ' fato 9 (sam) Vel 


This shows that, by detecting high-SNR events, and in particular golden events with SNR > 
100, ET will be able to measure the QNM spectrum at the percent level for individual events. 
Moreover, by stacking together multiple events (see Table 14 below), it will be possible to 
test deviations from the GR predictions at the sub-percent level. 

Next, using the BBH population catalog generated with the FASTCLUSTER code [53, 54] 
as described in Section 3, we estimate the event rates for BH spectroscopy with different 
ET designs. Figure 35 shows the distributions of source-frame mass M? = (1 + z)My and 
xf for the final BH, as predicted from the simulated BBH merger events in the catalog, 
and computed from the analytical fits in [177, 178].2? The spin x f clusters around 0.7 as a 
consequence of the fact that the majority of the BBH events in the catalog have progenitor 
mass ratio close to unity (with only a small portion having mass ratio q = mı/m2 Z 5) 
and moderate progenitor spins. Furthermore, we see that the distribution of the QNM 
fundamental frequency peaks at around 100 Hz, which sets the typical scale of the (almost 
monochromatic) ringdown signal. 

In order to assess the feasibility of BH spectroscopy with different ET configurations, we 
compute the number of ringdown detections per year. Here a detection corresponds to SNR > 
12 in the ringdown-only portion of the signal. Moreover, we compute the number of ringdown 
detections per year with ringdonwn signal-to-noise ratio SNR > 50 and with SNR > 100, the 
latter representing golden events for BH spectroscopy. All detector designs are assumed in the 
HFLF-cryo configuration. We did not find any significant difference when the LF instrument 


??We also benchmarked them against surfinBH [179], which is a machine learning predictor trained on 
numerical simulations, finding excellent agreement within the common regimes of validity. 
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Figure 35: Top: Distribution of the redshifted final masses and dimensionless final spins from the 
simulated BBH events as detailed in Section 3. Bottom: Corresponding distribution of the ringdown 
frequency and damping time of the dominant (2,2,0) mode. 


is absent. This is due to the fact that the ringdown signal is almost monochromatic and, as 
shown in Fig. 35, there is a negligible portion of events for frequencies f220 < 20 Hz, where the 
contribution of the LF instrument impacts on the ET sensitivity curve. Results are reported 
in Table 14 for the particular realization of the catalog considered here, where the rates are 
subjected to the statistical Poisson counting uncertainty e (Nae) ~ V Naet- 

In conclusion, for BH spectroscopy, the baseline 10 km triangle configuration allows 
for ~ 34 high-SNR events and 3 golden events per year. While these numbers are already 
extremely interesting, both the 15 km triangle design and the 15 km 2L design allow to 
increase by a factor ~ 3 — 4 the detection rate w.r.t. the baseline, with 90+ high-SNR events 
and 9+ golden events per year and a slight preference for the 15-km triangle. The 20 km 2L 
configurations show the best performances in terms of detected events, with ~ 240 high-SNR 
events? and ~ 20 golden events per year. Table 14 also reports the event rates as measured 
by the LVKI network in O5. We see that ET will constitute a game changer for probing 
near-horizon physics. 

Next, we investigate the redshift reach of BH spectroscopy for the two representative 
cases of a 10-km triangle and a 15km-45?-2L configuration. Figure 36 (left) shows the event 
rate per year as a function of the redshift: while ET is capable of detecting ringdown events 


3° A back-of-the-envelope estimate, assuming that by stacking Naet signals the errors shown in Eq. (6.2) are 
reduced by a factor + V Na«, suggests that by stacking multiple ringdown detections with SNR > 50, the 
2L-15 km-45? configuration can reach z 0.1% accuracy for the subdominant QNMs. 
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Naet(SNR > 12) | Na«(SNR > 50) | Na«(SNR > 100) | max(SNR) 
LVKI-O5 15 0 0 38 
ET 
A-10 km 4665 34 3 262 
A-15 km 11692 117 13 317 
2L-15 km-0° 11627 110 11 281 
2L-15 km-45° 10175 97 9 330 
2L-20 km-0° 18972 255 21 327 
2L-20 km-45° 17185 228 18 384 


Table 14: Number of ringdown events per year above the detection threshold (SNR > 12), with 
high signal-to-noise ratio (SNR > 50), and golden events (SNR > 100) for different proposed detector 
designs (all taken in the HFLF-cryo configuration) and compared to the most optmistic prospect for 
LVKI in O5. The last column displays the SNR of the loudest event in the catalogue. Here SNR 
refers to the ringdown part of the signal only. 
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Figure 36: Ringdown event rates as a function of redshift (left) and mass ratio (right) for the baseline 
10km triangle and the 15km-45°-2L configuration. Different colors indicate detectors configurations; 
continuous, dashed, and dotted contours indicate different ringdown SNR thresholds. 


up to redshift z < 14, high-SNR are limited to z < 4 and golden events are limited to 
z S 2. Figure 36 (right) displays the complementary information of the event rate as a 
function of the mass ratio: we see that the loud ringdown events are limited to small mass 
ratios q SS 3 while high mass-ratio events are detected with moderate or small SNR. Since 
different mass ratios excite different modes in the ringdown spectrum, high mass-ratio events 
offer valuable complementary information and it is, therefore, desirable to develop efficient 
stacking techniques [180-182] to take advantage from them. 

Finally, in Table 15 we also computed the event rates when ET operates in synergy 
with a single 40 km CE detector and with a network of one 40 km CE and one 20 km CE. 
The results show that joint detections by ET in its baseline configuration and CE produce 
ringdown rates comparable with ET alone in its 2L-20 km configurations, and that a 15 km 
or 20 km ET in synergy with CE will constitute a major improvement for BH spectroscopy as 
it will result in between 30 to 50 golden events per year. We also observe that the differences 


aT 


ET (ICE) || Na«(SNR > 12) | Na«(SNR > 50) | Na«(SNR > 100) | max(SNR) 
A-10 km 17268 188 18 298 
A-15 km 23634 311 29 350 

2L-15 km-0° 23342 311 29 317 

2L-15 km-45° 22262 290 27 362 

2L-20 km-0° 29307 504 41 359 

2L-20 km-45° 28126 439 38 412 
ET (+2CE) 
A-10 km 20990 268 25 308 
A-15 km 27065 383 38 359 
2L-15 km-0? 26691 384 37 327 
2L-15 km-45? 25749 357 37 370 
2L-20 km-0° 32266 597 49 368 
2L-20 km-45° 31197 531 45 420 


Table 15: Same as Table 14 but with a single 40 km CE detector (top) or with one 40 km CE 
detector and one 20 km CE detector (bottom) added to the network. 


among the results, induced by different choices of the ET geometry, remain quite significant 
even when ET is in a network with 1CE or with 2CE. 


6.1.2 Searching for echoes and near-horizon structures 


Exotic compact objects (ECOs) are horizonless objects predicted in certain quantum-gravity 
extensions of GR and in the presence of exotic matter fields [183-185]. Some broad motiva- 
tions for ECOs are (see [184] for a review): i) resolving the singularities inevitably present 
inside BHs within GR; ii) evading deep conceptual conundrums associated with the presence 
of a horizon (most notably, the information loss paradox); or iii) simply providing effective 
models for new species of compact objects that might co-exist in the universe along with BHs 
and NSs. 

Several models of ECOs have been conceived, including BH “fuzzball” microstates in 
string theories [186, 187] and boson stars as self-gravitating objects made of massive bosonic 
fields minimally coupled to GR. [188, 189]. One can devise a model-independent frame- 
work [190] in which the ECO properties are parametrized in terms of their effective radius 
ro = r4 (1 €) (where r+ is the would-be location of the Kerr BH horizon and e is a dimen- 
sionless closeness parameter), and the reflectivity R at the effective radius with a generic 
phase ¢(R). The BH case is described by a null reflectivity at the horizon for any frequency, 
whereas an ECO reflects part of the incoming radiation, i.e. R Z 0 (the reflectivity is gener- 
ically complex and frequency dependent). Because of this reflectivity, ECOs emit a different 
GW signal relative to the BH case. One of their smoking guns is the emission of a modu- 
lated train of GW echoes in the late-time ringdown stage of a compact binary coalescence, 
associated to signals reflected off the object interior [191, 192]. When an ECO remnant with 
€ « 1 forms, a ringdown analogous to the BH one is emitted at early times, followed by 
a new signal in the form of late-time echoes. Several searches for GW echoes in LVK data 
have been performed, claiming no evidence for their observation [8, 193-196] (although some 
results are controversial, see [185]). 
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Figure 37: Fractional percentage errors on the ECO reflectivity for a GW150914-like event placed 
at dr = 1 Gpc with final mass Ms = 70Mo and final spin xy = 0.68. The detector is taken in the 
cryogenic HFLF configuration and errors are averaged over the sky position and the inclination angle. 
There are no significant deviations in the figure for the HF-only configuration. The Fisher-matrix 
parameters are (My, Xf, R, (R), e), where R is generically a complex number, and the amplitude 
and phases {Ajmn, ¢imn} of the excited modes. 


Here, we assess the detectability of GW echoes with different ET configurations. We 
focus on a GW150914-like system (final mass My = 70Mo and final spin xy = 0.68) placed 
at a fiducial distance dr, = 1 Gpc. We use the frequency-domain echo template developed 
in Ref. [197] for a representative choice e = 107? where we vary the reflectivity in the range 
R € [0.01, 0.99] and set ¢(R) = 0. Figure 37 shows the fractional percentage errors (obtained 
with a Fisher matrix analysis) on the reflectivity of an ECO for different ET configurations. 
'The fractional error is a proxy for the detectability of ECOs, since the BH case corresponds 
to R = 0 and therefore any putative measurement of R should exclude R = 0 with the 
highest confidence level. We report the values of c /'R. after averaging over the sky position 
and the inclination angle. We take the detectors in the cryogenic HFLF configuration but, 
as in the ringdown analysis reported in Section 6.1.1, we do not find significant deviations 
when the LF instrument is not operating. 

The main result is that the detector configuration affects the accuracy on the reflectivity 
of compact objects by a factor of ~ 2 between the 10 km designs and the 20 km designs, with 
the 15 km designs representing the best compromise. 

This is further quantified in Table 16 for three values of the reflectivity (small, medium 
and large, respectively). We can ascribe these differences to the larger SNR of longer arm 
length configurations. The distribution of the SNR follows the same statistics of the ringdown 
SNR as reported in Table 14 and Fig. 36 and therefore the same considerations apply about 
the advantage of a 15-km design relative to the baseline 10-km design. 


6.1.3 Constraining tidal effects and multipolar structure 


The multipolar structure of a Kerr BH can be elegantly written as [198], 
MPH SPP = MH (x), (6.3) 
where x := J/M? is the dimensionless spin, M = Moo, J = Sio, Me = Meo, Se = Seo, with 


Mim and Sım being the mass and current multipole-moment tensors, respectively. For a Kerr 
BH, the only nonvanishing multipole moments have m = 0 and the mass (current) multipole 
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moments vanish when £ is odd (even). These properties are a consequence of the axial and 
equatorial symmetry of the Kerr solution. 

In general, objects other than BHs violate these symmetries and the unique relation (6.3) 
between all multipole moments and the BH mass and spin. Their multipolar structure can 
be schematically written as 


Mim = MP#+6Mim , Sem = SPH 6S, (6.4) 


where ó M4, and 6S¢,, are some model-dependent corrections to the mass and current mul- 
tipole moments. 

Generically, the mass quadrupole moment Məm is the dominant multipolar contribu- 
tion, entering the inspiral GW signal at second PN order [199]. When m Æ 0, this term 
can induce binary precession even in the absence of progenitor spins [200]. Deviations from 
standard BH predictions can be parameterized by defining Moo = —&M?x?, where k = 1 
corresponds to the Kerr case, while x(x) in all other cases. At the 2PN order, contributions 
from the quadrupole moments of the binary components enter the waveform through sym- 
metric and antisymmetric combinations given by Ks = (k1 + &2)/2 and ka = (k1 — &2)/2, 
respectively. The latter can be recast in terms of deviations from the Kerr baseline as 
Ks = 1 + ôks, and kg = ÔKa. Figure 38 shows the absolute errors on the parameter dK, as a 
function of the binary total mass, for representative systems with mass ratio q = 2 and spins 
(xi, X2) = (0.9,0.7) aligned to the orbital angular momentum (and therefore nonprecessing). 
We use a TaylorF2 waveform model, truncating the inspiral phase at the ISCO of the Kerr 
remnant, also including corrections induced by the self-force and the spin of the less massive 
object [201]. 

We find that the differences on the uncertainties obtained assuming the triangle and 
the L-shaped configurations are typically of a factor two in favor of the triangle, across the 
range of masses we considered. 

Another smoking-gun deviation from the standard Kerr BH in GR is given by the tidal 
deformability, considered in Section 3.2 for BNSs. A remarkable result in GR is that the 
tidal deformability of BHs is precisely zero [202-204], at variance with any other object in 
GR or with BHs in modified gravity [205-210]. 

Figure 39 shows the absolute error on the binary tidal parameter A as a function of 
the source frame total mass, for non-spinning, equal-mass, optimally oriented binaries, at 
luminosity distance dr, = 100 Mpc. We assume the TaylorF2 waveform,*! cutting the inspiral 


3 "We checked that, at the level of the comparison between different configurations, assuming the IMR- 
PhenomD.NRTidalv2 waveform approximant gives analogous results. However, in the range of masses of its 


or/R [76 
R =0.01 | R=0.5 | R = 0.99 
A-10 km 422 7 0.06 
A-15 km 308 5 0.04 
2L-15 km | 326 5 0.04 
2L-20 km | 265 4 0.03 


Table 16: Fractional percentage errors on the measurability of the ECO reflectivity R for small 
(R = 0.01), medium (R = 0.5) and large (R = 0.99) reflectivity for the same system and detector 
configuration of Fig. 37. 
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Figure 38: 1-c uncertainty on the parameter controlling deviations on the symmetric combination 
of the spin-induced ECO quadrupole moment, Ks = 1+ 6«;, with ks = 1 corresponding to binaries 
with Kerr BHs. We assume the Kerr case (6k; = 0) and optimally-oriented sources with mass ratio 
q = 2, non-precessing spins x1 = 0.9, X2 = 0.7 at a luminosity distance of dy, = 100 Mpc. The optimal 
orientation depends on the detector location and configuration. The Fisher-matrix parameters are 
(Me, m dL, 0, Q, t, P, te, Be, X1,21 X2, OKis-} 
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Figure 39: Absolute (1-c) uncertainty on the tidal deformability A as a function of total mass M, 
in the source frame for the various configurations. We assume negligible deformability of the source, 
nonspinning equal-mass binaries, and an optimally oriented source at a distance dr, — 100 Mpc. The 
optimal orientation depends on the detector location and configuration. The Fisher-matrix param- 
eters are (.Mc, N, dr, 0, Q, t, Y, te, Be, X1,2, X2,z, A,A}. These results complement what presented in 
Section 3.2 for BNSs to the case of ECOs or subsolar BHs (i.e. PBHs) potentially observable with 
ET, see more details in Section 6.3.3. 


phase at the ISCO frequency of a remnant Schwarzschild BH with mass ~ Miot. Extending 
the Fisher analysis to higher frequencies would significantly improve measurement errors but 
would require specific ECO models. 

We conclude that different ET configurations do not impact significantly on the accuracy 


validity, the latter provides a smaller uncertainty on A due to the fact that the waveform (calibrated on BNS 
coalescences) is valid up to higher frequencies, as well as to a dependence of the amplitude on the deformability 
(which is only included in the phase of TaylorF2). 
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in the measurement of the tidal deformability for Mt; S O(Mo), while differences are slightly 
larger, within typically a factor of order 2, for heavier binaries (with the 2L configurations 
performing again better than the triangle ones). 


6.2 Nuclear physics 


One of the longstanding open challenges in nuclear physics is to determine the properties 
of matter at supranuclear densities. Given the extremely high central densities in neutron 
stars and the possibility to form merger remnants that not only probe the highest densities 
in our Universe before black hole formation but also matter at nonzero temperature, the 
Einstein Telescope will provide important nuclear-physics insights, see e.g., Refs. [15, 211— 
219]. In particular, it will be possible to probe the cold EoS from the inspiral phase with 
unprecedented accuracy and furthermore, in contrast to existing gravitational-wave detectors, 
there is a high chance of detecting gravitational-wave emission from the merger and its 
remnant which not only probes the cold EoS but would also allow us to place constraints 
at the EoS at temperatures up to ~ 50MeV; cf. e.g. [220] and references therein. For these 
reasons, it is essential to understand to which extent the different detector configurations 
impact possible nuclear-physics constraints. 


6.2.1 Radius estimation from Fisher-matrix computation 


We start by assessing the accuracy of constraining the EoS by providing estimates for the 
uncertainty in the inferred radius based on the Fisher matrix approach. The results used here 
differ from those in Section 3.2 due to the assumption of a common EoS for the NSs when 
producing the events catalog. In particular, we adopt the DD-LZ1 model? [221-223] and 
compute the adimensional tidal deformability parameters of the objects from their source- 
frame masses m, using the A(m) relation predicted by this EoS. The choice of DD-LZ1 is 
dictated by its high allowed maximum mass (of 2.56 Ma); this makes it consistent with the 
BNS population model used throughout this work which, as mentioned in Section 3, assumes 
a source-frame mass distribution of the component objects uniform between 1.1 Mọ and 
2.5 Mo. Moreover, the behavior of DD-LZ1 complies with all the present constraints from 
low energy nuclear physics [224]. As in Section 3.2, the results are produced using the GWFAST 
package [17, 32] and adopting the IMRPhenomD_NRTidalv2 waveform model. A detailed 
study of the effect of the injection EoS and waveform model is ongoing. 

We use two ways to transform the error of the tidal deformability A [defined in eq. (3.1)] 
into the error on the measurement of the NS radius R. The first approach assumes that 
A œx Rê, which leads to 

AR 1AA 

RTK 
and arises from relations derived in Ref. [225]. This allows us directly to compute the indi- 
vidual radius uncertainty AR/R for all detections. We summarize these results in Fig. 40. 
Comparing different configurations (left panel), we find that the main limiting factor is the 
arm-length of the gravitational-wave detectors, so the 10 km triangle performs less well than 
a 15 km 2L. Consistently with a number of other results in Section 3, the right panel of 
Fig. 40 shows that the 2L with 15 km arms at 45° in the HF-only configuration performs at 
a level comparable to the 10 km triangle in the full HFLF cryo configuration. 


(6.5) 


32We point out that the EoSs employed here do not contain phase transitions, however, we expect that the 
presence of a strong phase transition will likely influence the accuracy to which we can measure the neutron 
star radius. 
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Figure 40: Distribution of the relative errors on the NS radius attainable from the adimensional 
tidal deformability combination A, as measured by different detector configurations and sensitivity 
curves. In particular, the left panel shows the results for the six geometries considered, all with their 
best sensitivity, the central panel for the 10 km triangle with the two different ASDs, and the right 
panel for the 2L with 15 km arms at 45° and the two ASDs considered. 


An important consequence of the large number of BNS detections with ET is that we 
can noticeably improve the EoS constraints by stacking information from multiple detections 
at different masses, which enables constraining the EoS at different densities. To quantify 
this, we perform a simple analysis in which we assume a common radius of 12km radius 
for all neutron stars. While this assumption is a priori unrealistic, it nevertheless captures 
the feature that different neutron star measurements will probe the same underlying hyper- 
parameters characterizing the EoS. To obtain a combined ansatz, we compute the average 


(AA/A) and use 


1 AA 
ARa = —=— | = | 12km. 6.6 
i me (%) (6.6) 


We present results for Rau in Table 17 for all ET configurations, from which we see that 
loosing the LF instrument, the constraints gets worse by about a factor of 2, irrespectively of 
the geometry. 

In Table 18, complementing the results in Section 3.3, we also report the results for some 
ET geometries, all with their full HFLF cryogenic ASD, working in a network with 1 or 2 CE 
detectors. To improve the reliability of the Fisher matrix approach, we restrict the analysis 
to events with a relative error on the adimensional tidal deformability of AA/A < 50%. 
'This threshold is chosen to have a large number of events in each configuration and with 
each ASD, and also reflects the fact that the loudest events — with smaller errors — will 
be the ones providing most of the information. We emphasize that this error represents 
only the statistical uncertainty; however, systematic uncertainties, e.g., due to modeling 
errors in the gravitational waveform will also be important [226]. Moreover, we stress that 
the results shown in Fig. 40 and in Tabs. 17 and 18 depend on the choice of the EoS. 
From the dependence of the tidal deformability on the star's compactness, it is evident that, 
with increasing mass, the tidal deformability decreases. Thus EoS models predicting large 
maximum masses in general also predict very low values of A close to the maximum mass, 
which is difficult to constrain. Indeed, DD-LZ1 predicts tidal deformability as small as A ~ 5 
for m ~ 2.56 Mo. Assuming a different EoS [which might require a lowering of the maximum 
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ASD HFLF cryo HF only 
Geometry N d A<0.5 ARa [m] | N, Ed ASO. A Ray [m] 
A 10 km 4878 10.0 1412 19.2 
A 15 km 15285 5.7 4047 11.3 
2L 15 km 45° 12013 6.4 3194 12.4 
2L 20 km 45° 25489 4.4 7387 8.2 
2L 15 km 0° 11884 6.5 3357 12.1 
2L 20 km 0° 23988 4.5 7081 8.3 
1L 20 km 5464 9.4 1493 19.1 


Table 17: Number of detections, Naet with AA/ A < 50%, and statistical error on the NS radius 
obtained combining all of them according to (6.6), ARan (in meters), for various geometries and 
ASDs. 


ASD HFLF cryo + 1CE HFLF cryo + 2CE 
Geometry ee ASO5 ARa [m] NC A<05 AR [m] 
A 10 km 24129 4.6 35694 3.8 
2L 15 km 45? 35172 3.8 47791 3.2 
2L 15 km 0° 35217 3.8 48581 3.2 


Table 18: Same as in Table 17 with some selected geometries, all with the full HFLF cryogenic 
ASD, in combination with 1 CE detector with a length of 40 km and 2 CE detectors, one with a length 
of 40 km and the other of 20 km. 


mass in the mass distribution to accommodate the maximum mass allowed by the EoS] could 
result in tighter or broader constraints. This will be discussed in further details in section 
6.2.2. Furthermore, the uniform distribution adopted in Section 3.2 predicts smaller values 
for the error on the radius, but using an EoS enables including the physical effect of the tidal 
deformability parameter decreasing with increasing mass. The results presented here should 
be taken only as indicative, however, they still provide a relevant benchmark for comparing 
different detector configurations. 

Our second approach is based on first generating a prior distribution of EoS by Monte- 
Carlo sampling of a large parameter set of 10 independent, uniformly distributed empirical 
parameters. These parameters characterize the density dependence of the energy in the 
symmetric matter (i.e. equal number of protons and neutrons) and of the symmetry energy 
(i.e. the variation of binding energy as a function of the neutron-to-proton ratio). They are 
given by the successive derivatives with respect to the density of the symmetric matter and 
symmetry energy functionals, calculated at the equilibrium density of symmetric matter. 
'Their prior distribution is consistent with the present empirical knowledge for a large set of 
nuclear data [224]. The use of the same functional to describe the core and the inhomogeneous 
crust [227] guarantees a consistent estimation of the crust-core transition inside the neutron 
star and thus consistent predictions for its radius. This approach enables incorporating 
priors from nuclear physics on the EoS and including the uncertainties at high densities. 
'The only limiting assumption is that matter is composed of charged leptons, nucleons and 
nuclei only, and in particular that no first-order phase transition occurs. There might also 
be some dependence on the different approximations applied to treat inhomogeneous matter, 
e.g. assuming vanishing temperature, which could slightly modify the crust properties (see 
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Figure 41: Posterior distribution for NS mass-radius relation obtained assuming 5970 detections 
with geometry 2L -20 km with aligned arms. For comparison, the results obtained from the nuclear 
prior as well the underlying DD-LZ1 EoS model used in the Fisher matrix calculations are displayed. 


A 
Naet Rian a [km] Ro0Mo tant [km] 


Prior 12.9831020 13.1561054 
54 13.1631021 13.3587 0 549 
592 13.1461012 13.3550083 

5970 13.1077048 15:333 0:008 


Table 19: Number of detections, Na4; along with 1o uncertainty on radii for 1.4 and 2.0 solar mass 
NSs obtained for geometry 2L-20km-0° using nuclear prior. 


e.g. [228]). However, this should not affect the comparison of different configurations. The 
assumed nuclear prior complies with the chiral EFT energy per particle band for symmetric 
and pure neutron matter as given by [229] for baryon number densities 0.02 € ng < 0.18fm ?, 
see [230] for details of the current implementation. 

In Figure 41 we show the mass-radius distribution for the nuclear prior described above 
along with the posteriors at 68% and 95% confidence intervals assuming 5970 simulated 
“observations” obtained from a Fisher matrix calculation. We also indicated the DD-LZ1 
EoS used as input EoS in the calculation. The full compatibility of the DD-LZ1 model 
with nuclear physics constraints is shown by the fact that the injected mass-radius relation 
sits in the middle of the nuclear informed prior. This particular simulation was performed 
with geometry 2L -20 km with aligned arms (cryogenic), but we have checked that different 
configurations do not make any significant difference in the outcome, provided that the 
number of detections exceeds Nae ~ 6000. 

To understand the impact of different number of detections using the same geometry 
as above, we summarize the lo uncertainty on radii for 1.4 and 2.0 solar mass NSs with 54, 
592 and 5970 detections in Table 19. We can observe that the uncertainty on the radii only 
roughly follows the Poissonian law as described in eq. (6.6). The reason is probably that 
the radius is not a direct observable but, rather, is obtained in a convoluted way from the 
TOV equation and the underlying hypothesis on the EoS formalism. As a general statement, 
we can see that the Poisson hypothesis tends to underestimate the radius uncertainty from 
the Fisher matrix analysis, unless the number of detections is extremely large (more than 
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Figure 42: Left: Constraints on the chirp mass Me, mass ratio q, and tidal deformability A for a 
binary neutron star system with parameters listed at the top of the panel. A decreasing minimum 
frequency noticeably increases the accuracy of measuring the chirp mass and also has an important 
effect on determining the mass ratio but has a smaller effect on the measurement of the tidal deforma- 
bility. Right: Constraints on the tidal deformability for three different physical systems and different 
detector designs using an initial frequency of 6Hz. 


Naet ~ 6000). Moreover, we should also stress that the uncertainty estimations of Table 19 
might be underestimated themselves, because of the restrictive hypothesis of a continuous 
equation of state at any density. Nevertheless, these results give us an indicative impact of 
ET measurements on the radii uncertainties using a fairly general EoS, within the hypothesis 
of purely nucleonic degrees of freedom. 


6.2.2 Full parameter estimation results 


In addition to our estimates based on the Fisher matrix computations, we also perform full 
parameter estimation studies. 

For this analysis, we consider three different physical systems (A,B,C) corresponding to 
different input equations of state (APR4 and H4), mass ratios (symmetric or asymmetric), 
and spins (zero or non-zero). The values of those injection parameters can be seen on the 
different panels of Fig. 42, and on Tab. 20. The choice of the particular systems follows the 
injection study of [231] done by the LVK. 

For these studies, we use the relative binning technique [232-234] to reduce the com- 
putational cost of our analysis. Figure 42 shows some of our main findings, and we refer 
to [235] for more details. We analyze signals injected in zero noise, to avoid the impact of 
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Name Injection model | mi, m» [Mo] | Me [Mo] Ay, Ag A X1, X2 


Source A APRA 1.68, 1.13 1.19479 77, 973 303 0,0 
Source B APRA 1.38, 1.37 1.19700 275, 309 292 | 0.02, 0.03 
Source C HA 1.38, 1.37 1.19700 | 1018, 1063 | 1040 0, 0 


Table 20: Source properties used for the full PE injections in Sec. 6.2.2. 


source Geometry Nsat Esat K sat Esym Lsym Ksym 


source-A | 2L 20km 0° | 0.164*0007 | —16.187041 | 228721 | 31.657275 | 40.9792, | —262781 
A 10km | 0.164805 | —16.147025 | 22927 | 31.594353 | 40.14153 | —269+33 


source-B | 2L 20km 0° | 0.627700" | —16.061037 | 227137 | 30.914387 | 42.04530 | —197 123 
A 10km | 0.163Ł8806 | -16.054033 | 29727 | 30.861334 | 40.9175 | —20811$3 


source-C | 2L 20km 0° | 0.15540:005 | —15.9440:34 | 236731 | 30.964354 TOS 102 | 7v% 
A 10km 10.156 00.5 | —15.9210%35 | 28615 | 30:99 205. | Oe | 74433 


Table 21: Values for the nuclear-matter parameters up to second order in units of MeV except for 
Nsat, Which is in units of fm^?. For ET, the two configurations leading to the smallest and largest 
uncertainties have been chosen; all other configurations give uncertainties in-between these. 


fluctuations from noise when analyzing different starting frequencies. T'he left panel shows 
the impact of varying the initial frequency on the recovery of the chirp mass, the mass ra- 
tio, and the tidal deformability. Overall the width of the 9096 confidence interval for the 
recovery of the chirp mass reduces by more than an order of magnitude when going from 
20 to 6Hz, and also the size of the 9096 confidence interval for the mass ratio shrinks by a 
factor of 2. However, the constraint on the tidal deformability reduces only by about 2596. 
Comparing the AlO0km and the 2L 15km setup, we find that the 2L configuration is overall 
better for measuring tidal deformability by about 25%, where this difference is more or less 
independent of the initial frequency. As shown in [235], this is due mainly to the different 
arm lengths. In the right panel of Fig. 42, we compare the A posteriors for three different 
physical systems, and five different detector configurations. In this case, we use an initial 
frequency fio, = 10 Hz for all the runs, and a noise simulated with the HFLF cryogenic 
sensitivity. We find that in general, the aligned or misaligned 2L detector configurations lead 
to similar results. The 2L configurations result in tighter bounds on A than the A10km one; 
as above, these improvements arise mainly from the different arm lengths. It is also worth 
pointing out that the error on A for the three selected sources, as reported in the right panel 
of Fig. 42, is consistent with that found by Fisher matrix calculations within a few percent, 
for the various detector configurations considered. 


6.2.3 Connected uncertainty of nuclear-physics parameters 


An immediate question is to what extent the different proposed configurations of the instru- 
ment can improve on our knowledge of nuclear matter and NS properties. In Section 6.2.1 
we have already discussed estimates for the uncertainties on the NS radius resulting from 
the different detector configurations. Here we will continue the discussion by analysing the 
full parameter estimation results for the three sources discussed above. As above, we em- 
ploy the prior on the EoS from present-day nuclear physics knowledge [230] as a baseline. 
Because of the nuclear physics constraints, a number of popular equations of state might be 


—81- 


Source Geometry | Riau, [km] 
source-A | 2L 20km 0? | 11.44*013 
A 10km 11.348 
source-B | 2L 20km 0° | 11.304037 
A 10km B up 
source-C | 2L 20km 0? 13.691007 
A 10km | 13.684909 


Table 22: Radii (km) for 1.4M neutron stars as obtained for different geometries and different 
sources from the full parameter estimation runs under the nucleonic hypothesis. Again, as in table 21, 
only the geometries leading to the smallest and largest uncertainties, respectively, are shown here. 
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Figure 43: Posterior distribution for NS mass-radius relation (left) and the nuclear symmetry 
energy as function of baryon number density (right) for source A with geometry 2L -15 km with 
aligned arms. For comparison, the results obtained from the nuclear prior are also shown. 


only marginally compatible with our nuclear physics informed prior. This is particularly the 
case of the APR4 model, that was never confronted to nuclear physics data. This can be 
seen from the NS mass-radius relation shown in Fig. 43 (left panel) for source A and the 2L 
geometry with 15 km arm length. In this figure, we can see that the APR model predicts too 
small radii for the heaviest neutron stars, with respect to our nuclear prior °°. This explains 
why the injection value is not precisely recovered in the posterior distribution. For this rea- 
son, the radius uncertainty estimation will not be reliable for the higher masses and is not 
analyzed in the following. In table 21 we summarise the posterior results for the different 
nuclear-matter parameters up to second order at saturation density with the three sources 
for the two configurations of the instrument leading to the smallest and largest uncertainty, 
respectively. These parameters represent the equilibrium density (n;4;,) and energy (Esat), 
the symmetry energy at saturation (Esym) and its slope (Lsym), and the nuclear matter com- 
pressibility in the isoscalar (Ksat) and isovector sector (/X,,,,). For the symmetry energy, 
the posterior as a function of density is shown in Fig. 43 (right panel) for source A and the 


??In this figure, in order to allow for slightly smaller radii, we have relaxed the 2 solar mass constraint for 
the nuclear prior which is applied in Fig.41 above. 
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2L geometry with 15 km arm length. The results indicate first of all that the injected prop- 
erties are reasonably well recovered, even if the input model is only marginally compatible 
with our nuclear physics informed prior. On the scale of this figure, the differences between 
the proposed configurations of the instrument are insignificant and the 2L geometry is only 
slightly favored concerning the constraints on the nuclear-matter parameters. 

In Table 22 we summarise the posterior results for the NS radius at the fiducial mass 
1.4 Mo. with the three sources for the two configurations of the instrument leading to the 
smallest and largest uncertainty, respectively. We can see that the uncertainties depend 
stronger on the injection model than on the configuration of the instrument. The absolute 
values are different from those obtained with the Fisher matrix approach, and reported in 
Tab. 17 and 19. This is essentially due to the fact that the results in Tab. 22 are obtained 
on single events with very high SNR, which thus result in tight constraints on the source 
parameters, while the ones in Sec. 6.2.1 are produced from runs on a full realistic population 
of sources, the vast majority of which have much higher statistical uncertainties, resulting in 
looser constraints on the EoS parameters. Also, we again stress that the results presented 
in Tab. 17 are obtained through the extremely simplified metric in Eq. 6.6, and should thus 
only be considered as indicative for the comparison of the various detector configurations. 

To conclude, even if the quantitative uncertainties depend on many not fully controlled 
factors, from the NS mass-radius posterior shown in Fig. 43 (left panel), the improvement with 
respect to current knowledge by Einstein Telescope is clearly seen. In addition, the present 
analysis assumes at each time one detected source, i.e., an inspiral with two given masses, 
whereas the Einstein Telescope will detect tens of thousands of different BNS merger events 
with a distribution of NS mass. This staggering will considerably reduce the uncertainty on 
the density dependence of the dense matter EoS and hence the nuclear-matter parameters 
and the information on the NS radius; see also the discussion in Section 6.2.1. The impact of 
the instrument geometry on these conclusions is only marginal; as above, the results indicate 
that the 2L geometry is favored, but the difference with the other geometries is not significant. 


6.2.4 Postmerger detectability 


A BNS system typically merges at a frequency of ~ 1 kHz. After the merger, there are a 
number of physical processes, such as thermal effects, turbulences, phase transitions, mag- 
netohydrodynamical instabilities, dissipative processes etc. [212, 236—241], which drive the 
postmerger physics leading to different remnants, depending on the masses of the neutron 
stars and EOS of the system. The merger remnant can be either a black hole, a hypermas- 
sive NS (HMNS), a supramassive NS (SMNS) or a massive NS. If the mass of the merger 
remnant is above a threshold mass it will collapse to a BH [242-244]. Otherwise, there is 
a post-merger gravitational-wave signal that can last over a time scale of about 10ms up 
to several 100ms [245] emitted at frequencies of ~ 2 — 4 kHz. While the postmerger signal 
from a BNS merger is not expected to be detectable for 2G detectors due to their insufficient 
sensitivity at high frequencies, it will be accessible to 3G detectors. Due to the complicated 
physical effects involved, postmerger signals are most accurately represented by numerical 
relativity (NR) simulations. For this study we will use NR waveforms [219, 246—249]. 

To determine the detectability of the postmerger signal with different detector configura- 
tions, we will compute and compare the SNR from the postmerger signal for a set of NR BNS 
waveforms as seen by the various configurations considered in this study. The postmerger 
SNR is defined as the SNR of the signal from the merger frequency [250] of the BNS system, 
up to 8192 Hz. We use a set of 6 BNS-NR waveforms from the SACRA catalogue [251], the 
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massl | mass2 | Aj A» 
1.35 1.35 1211 | 1211 

1.46 | 3196 | 535 

1.56 | 1415 | 238 


Configuration name 
15H. 135.135. 00155 
125H. 107.146.0015 1.07 

H.117.156.00155 1.17 


H.135.135.00155 1.35 1.35 607 | 607 
1.21 1.51 | 1621 | 435 


125H.121.151.00155 
H_118_155_00155 1.18 1.55 | 1354 | 249 


Table 23: Table of the BNS-NR waveforms and their physical parameters used to compute the 


postmerger SNR. 
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Figure 44: Left: Averaged postmerger SNR for the 15H_135_135_00155 system for different detector 
configuration and PSD combinations. Right: Averaged postmerger SNR for all the BNS-NR systems 
as given in Table 23 for different detector configuration and HLFL cryo PSD. Each BNS-NR system 


is plotted with a different color with the detector configuration given on the z-axis. 


masses and tidal deformabilities of which are given in Table 23. For each detector geometry 


and PSD configuration, and each NR waveform, we use the same random set of 60 (ra, dec, 
V) values and quote the averaged postmerger SNR (8X8) for each of the NR waveforms. 
AV8 for all the detector geometry — PSD combinations. 


The left panel of Fig. 44 shows ppm 
for the system 15H_135_135_00155. As the high-frequency sensitivity of the HFLF cryo and 


HF-only PSDs are quite similar, the postmerger SNRs for these two PSDs are almost equal. 
We see the majority of variation occurring with varying arm-length for a given detector 
shape and changing detector shapes (triangular vs. 2L configurations). The maximum pA 
for this system is seen by the 2L 45? 20km HLFL configuration which is ~ 20% larger than 


the minimum value, which corresponds the A 10km HF configuration. 
The right panel of Fig. 44 shows a for all the BNS-NR systems for the different 


detector configurations and only the HFLF cryo PSD. For all the BNS-NR systems, 2L 20km 


45? gives the highest postmerger SNR. For pA there is not much difference between the 


SNR for 2L 0° and 45? (~ 0.7 96) irrespective of the arm length. Going from 15km 2L to 
Av8 Between A 10 km and A 15 km, we see ~ 11% 


20km 2L gives a ~ 6% increase in ppm’. 


— BA 


increase. 

These results show that the performances of the different geometries are relatively similar 
within a 10 — 2096 level, with 2L 45? 20km detector configuration being the best for detecting 
postmerger signals. 


6.2.5 Conclusions: nuclear physics with ET 


'The Einstein Telescope will significantly advance our ability to constrain fundamental nuclear- 
physics properties, going well beyond the capabilities with any existing observatory or exper- 
iment, and providing complementary constraints to explore large domains of the QCD phase 
diagram. For example, the upgrade of the FAIR at the GSI Helmholtz Centre for Heavy Ion 
Research will probe matter which is only slightly neutron-rich and at lower baryon densities 
and higher temperatures than what is expected for ET, which can reach the most extreme 
densities. Furthermore, the X-ray observatory NICER and proposed future instruments, 
such as eXTP, Strobe-X or ATHENA will be able to improve on current knowledge of cold 
dense matter by observing more pulsars and increasing the length of the observational data 
period and thus obtain NS radii with a precision of a few percent, similar to the antici- 
pated improvements with the existing advanced LIGO and advanced Virgo detectors (even 
in combination with KAGRA and LIGO India) or proposed upgrades such as LIGO A# and 
Virgo.next. ET, in addition to being able to determine NS radii with a statistical precision 
of sub-percent order due to the immense statistics with 10? mergers expected per year, will 
also be the only instrument able to add to the zero temperature f-equilibrated NS matter 
the information on the hot ultra-dense matter from the postmerger phase. With regards to 
the payoffs for nuclear physics, there is no significant difference between the different detector 
configurations, with longer arm-lengths leading to slightly better results. 


6.3 Population studies 
6.3.1 Merger rate reconstruction 


Since ET will have a long distance range, it will see binary coalescences out to cosmological 
distances. Therefore, it will probe the merger rate up to high redshifts. Knowing how the 
BBHs are distributed in the Universe would help constrain their various formation channels, 
leading to a better understanding of how such binaries form [54, 252-255]. Different models 
exist, often assuming that the merger rate follows the star formation rate with some additional 
contribution coming from primordial black holes. Models differ in the details but most of 
them agree on the general shape of the distribution: the merger rate increases up to a redshift 
between 1 and 2 before decaying for higher redshifts (see [252, 255] for examples). The merger 
rate as a function of the redshift is written as [256] 


dR dV 
T= Pt. (6.7) 


where V (z) is the comoving volume up to redshift z and 


T px(zp(z, ta)) 
p(z) ox p» eee did (6.8) 


is the binary formation rate, where p, is the star formation rate, z is the redshift of interest, 
zy is the redshift at which the binary is formed, tg is the time delay between the formation 
of the binary and its merger, and P(t4) is the probability to have a given time delay. 


— 85 — 


injected 
reconstructed, A 10km 
reconstructed, 2L 45? 15km 


35000 4 
$ 


30000 1 


0 
1.0 2.0 3.0 4.0 5.0 6.0 7.0 


ON A 10km 
ONA 10km 


ON 10km 
ÔN2L 09 15km 
H 
il 
i 

i 

i 

il 

4 

i 

i 

i! 
Hi 
i 
l 
il 
i 
i! 
IH 
M 
IH 
I 
i 
i! 
L| 


ON A 10km 
à Nor go 20km 
= 
e 
it 
| 
i 
i 
i 
i 
I 
-— 
Hi 
I. 
Li 
i 
il 
d 
i 
il 
1 
Li 
il 
1 
i 
E 
i 
il 
tp 
ii 
i 
il 
* 
i 
[ 
co 


ON A 10km 
Ó Nor 459 15km 
SN 
ce 
H 
I 
i! 
i! 
a 
i 2 
H 
i! 
He— 
l 
1 
L| 
4 
I 
I 
w 
A 
ot 
aD 
~ 


| 
Cane 
l 
j 
| 
H 
| 
| 
| 
hii 
l 
i 
l 
E 
44 4 
T 
= 
rj 
H | 
rj 
L3 
i 
Li 
i 
N 
2] 
!] 
Irt- 
l 


Nor, 459 20km 
r] 
[ 
[ 
1 
[ 
[ 
| 
1 
[ 
[ 
1 
I 
[ 
i 
i 
1 
1 
[ 
1 
[ 
[ 
[ 
i 
I 
[ 
1 
ot 
aD 
E 


ON 10km 


Figure 45: Top: Representation of the reconstructed merger rate with the 90% confidence interval 
for 10 years of observation of ET in a triangular shape with 10km arms (orange) and ET with 2 
L-shaped detectors with 15km arms at 45? (green). The blue line represents the true merger rate 
distribution. The two configurations enable one to reconstruct the merger rate but the accuracy is 
better for the 2 L-shaped detectors. Bottom: Ratio of the width of the 9096 confidence interval for 
the triangular 10km configuration with the other possible configuration. The red line corresponds to 
the case where the two detectors are equivalent, while the blue line is the average ratio. The hierarchy 
to be seen in this figure is that the 2-L aligned detectors are better than the triangular ones, and the 
2L with arms at 45? are more accurate than the 2 L's with arms at 0?. For a given configuration, the 


increase in arm length gives a small improvement, mainly due to the increase in SNR. 
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To evaluate how well different configurations for ET would be able to identify the 
merger rate density, we follow a procedure similar to [257], with a theoretical population 
made of 10 years of BBHs as given by [255], and discussed in Section 3. Then, we start 
by assigning a random sky location to the binaries. We compute their (network) SNRs and 
keep only the events with an SNR higher than 9 for a given configuration. The others are 
considered non-detectable. These events are then used to estimate the detector's efficiency 
as a function of the redshift. For each event, we perform a Fisher matrix analysis to obtain 
a multivariate Gaussian likelihood and sample from it to have an approximation for the 
measurements [216]. For each detectable BBH, we make a Gaussian centered on the mean 
value of the recovered posterior (which corresponds roughly to the injected value since the 
result of a Fisher matrix is centered on the true value), with a width equal to the standard 
deviation of the luminosity distance posterior obtained from sampling the Fisher likelihood. 
We then draw a measured luminosity distance. This is done for all the detectable BBHs. 
Then, we convert the luminosity distances to redshifts and count the number of events that 
we have in the redshift bins. This gives an observed merger rate, which is then corrected 
by the efficiency to account for the number of mergers we theoretically expect to miss for 
each redshift bin. We repeat the drawing of measurements 250 times to get error bars on the 
observed merger rate. 

The top panel of Figure 45 shows the reconstructed merger rate for 10 years of detections 
with ET seen as a single triangle with 10 km arms versus ET as 2 L’s with 15km arms and a 
45? rotation between the two detectors. The other panels show the ratio in accuracy between 
the single triangle with 10 km arms and the other configurations considered in this work. 
Although the two configurations lead to a correct reconstruction of the merger rate, ome sees 
that the configuration made of 2 L-shaped detectors leads to a more accurate reconstruction 
as it has smaller error bars and closer central values. T'he increase in the ratio is such that 
a hierarchy can be established between the different configurations. 2 L's are better than a 
single triangle and L's rotated by 45? are better than aligned ones. Table 24 shows the mean 
ratio found between the detectors. The improvement from one detector configuration to the 
other is driven by a reduction in error on the luminosity distance. On the other hand, for a 
given configuration, 5 km longer arms are slightly better, which can be understood by the 
increased SNR of events. Nevertheless, it is still possible to have a good grasp of the merger 
rate density for all the configurations within 10 years of observation. This should be quite 
helpful in the quest to reveal the formation channels for BBH mergers. 


6.3.2 Constraints on PBHs from high-redshift mergers 


Primordial black holes (PBHs) may form in the early universe and represent a dark matter 
(DM) candidate (see e.g. Ref. [258] for a review). Many constraints were set on their 
abundance in a wide range of masses [259] and recent GW detections suggest that, within 
standard scenarios, PBHs can only account for a subdominant DM component in the stellar 
mass range [119, 260—281]. Nonetheless, a discovery of a sub-population of PBHs would have 
fundamental consequences for our understanding of cosmology and particle physics (see e.g. 
[282-286]). 

One “smoking-gun” signature of the primordial scenario is the production of high red- 
shift mergers, which are unique targets of 3G detectors. The PBH model predicts a merger 
rate density that grows monotonically with redshift [264, 266, 268, 287, 288], with a power-law 
scaling of the form Rppu(z) œ [t(z)/t(z = 0)]| 94/27, where t(z) indicates the age of the uni- 
verse at redshift z. This rate evolution extends up to redshifts z > O(10?). On the contrary, 
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Other configuration Ratio width 90% confidence interval 


A 15 km 1.2 
2L0? 15 km 1.8 
2 L 0° 20km 2.0 
2 L 45° 15 km 2.9 
2 L 45° 20 km 3.3 


Table 24: Mean values for the ratios of the 90% confidence intervals in the merger rate reconstruction 
between ET triangle with 10 km arm-lengths and the other possible configuration envisaged in this 
work. 2 L’s are more accurate than one triangle, with L’s rotated by 45° the one with respect to 
the other being better. This happens because of the reduced error on the sky location. For a given 
configuration, increasing the arm length by 5km does not strongly increase the accuracy since the 
only source of improvement is the increased SNR for the observed events. 


even though large uncertainties are still present, astrophysical-origin mergers are generically 
not expected to occur at z = 30 in standard cosmologies [289] (see also [121—124, 290-303]). 

Recently, it was shown that a PBH population which may be compatible with a subset 
of the recent LVK observations would also lead to a large number of high-redshift detections 
at 3G experiments [277]. Observations of such distant events may be characterised by large 
measurement uncertainties on the inferred luminosity distance due to the low SNR [304— 
307]. However, focusing on constraining the merger rate evolution at redshift larger than 
O(10) may allow probing PBH populations up to abundances as low as fppy 7 1075 [308] in 
the solar mass range (even including the contamination of a population of Pop-III binaries), 
which would improve current capabilities by orders of magnitude. Therefore, the search for 
high-redshift primordial mergers is one of the scientific targets of the ET experiment. 

In order to test the performance of different configurations to search and constrain 
a PBH population, we consider the benchmark PBH model resulting from the analysis in 
Ref. [279]. The PBH model hyper-parameters are: [M., o] determining the PBH mass dis- 
tribution, modeled as a lognormal function with mean M, and width c; the abundance fppy 
normalised with respect to the dark matter; the redshift z,,,,4 characterising PBH accre- 
tion [309]. These are assumed to be given by the population inference result of Ref. [279], 
constraining the potential contribution from a PBH subpopulation to the GWTC-2 catalog 
and representing an upper bound on the PBH abundance in this mass range. In particular, 
we adopt Me = 34.54Mo, o = 0.41, fesu = 107864 and Zarot = 23.90, such that the PBH 
channel may be adequate in explaining around (1—21)% of the detections in the 01/02/03a 
run of LVC, and given the associated astrophysical models considered, the mass gap event 
GW190521. 

We compute the number of detectable PBH events, N44, by estimating the detector 
network selection bias based on an injection of N = 10° events at redshift z > 10. We 
randomise event directions and orientations, and we assume negligible PBH spins (compatible 
with the prediction of the standard scenario at high redshift [268, 309, 310]). The computation 
is performed using the IMRPhenomHM waveform model. We set a detection threshold 
SNR > 9. The minimum testable abundance is conservatively estimated by requiring at 
least one detectable event per year with z > 30 [277, 311]. This simplified approach provides 
an estimate of the ET capabilities which is consistent with actual population analyses, as 
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Configuration | Nae«(z > 10) [1/yr] | Na«(z > 30) [1/yr] | fep? "e? [x107?] 
A-10km 1140 77 2.61 
A-15km 1764 261 1.42 
2L-15km-0? 1597 238 1.48 
2L-15km-45° 1651 221 1.54 
2L-20km-0? 1984 434 1.10 
2L-20km-45? 2080 416 1.12 


Table 25: Capabilities for constraining high-redshift PBH mergers of the various ET geometries 
(all taken with their HFLFcryo ASD). From left to right, columns report the number of detectable 
events per year at redshift larger than 10 or 30 assuming the PBH population from Ref. [279]. The 
rightmost column reports the minimum testable PBH abundance, see more details in the main text. 
In bold we indicate the reference configuration. 


Configuration Naet(z > 10) [1/yr] | Na«(z > 30) [1/yr] | fogged (c1075] 
CE40km 1373 4T 3.34 
A-10km + CE40km 1940 180 1.71 
A-15km + CE40km 2276 372 1.19 
2L-15km-45° + CE40km 2210 333 1.26 
2L-20km-45° + CE40km 2476 522 1.00 


Table 26: Same as Table 25 for four ET geometries in a network configurations including one 40km 
Cosmic Explorer. ET is only taken here in its HFLFcryo configuration. 


Configuration Naet(z > 10) [1/yr] | Na«(z > 30) [1/yr] | feye [x107°] 
A-10km-HF 15 0.00 - 
A-15km-HF 85 0.00 - 
2L-15km-0°-HF 75 0.00 - 
2L-15km-45°-HF 69 0.00 - 
2L-20km-0°-HF 178 0.00 - 
2L-20km-45°-HF 170 0.00 - 


Table 27: Same as Table 25 for configurations with the HF-only ASD. 


performed in Ref. [308]. 

We report the result of the analysis for the various configurations in Tables 25, 26 and 
27, detailing how the detection prospects change as a function of the configuration geometry, 
the inclusion of a single 40 km CE detector in the network, and removal of the low-frequency 
instrument, respectively. In each table, we indicate the number of detectable events above 
redshift z > 10 and z > 30, alongside the estimated minimum testable abundance. The 
comparison of Tables 25 and 27 shows the important of the LF instrument for the observation 
of PBH at large redshifts. In particular, without the LF instrument, there are no detection 
at z > 30, for all the the geometries considered. 

The importance of the LF instrument when searching high redshift mergers can be 
intuitively understood by looking at Fig. 46, where we compare the signal amplitude for a 
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merger with source frame mass Mt% = 20Mo located close to the horizon (i.e. having SNR 
— 9 in the A-10km-HFLF-Cryo configuration). Due to the large source redshift, the detector 
frame mass is redshifted by a large factor, and the majority of SNR can only accumulate in 


the LF portion of the sensitivity curve. 


10719 


— f? 8,17? (10km — HFLF-Cryo) 
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Figure 46: Comparison between the amplitude of a PBH binary signal 2f|h(f)| and the ET detector 
sensitivity curves (fS,,)!/? shown in green (10km-HFLF-Cryo) or yellow (10km-HF). We consider a 
distant merger with source frame mass Mł% = 20M at z = 52. Also, we show a subsolar merger 


src 


with Mit = 1Mọo at z = 0.88. In both cases, we assume q = 1, optimal orientation of the binary and 
the redshift is fixed in order to have SNR=9 in the A-10km-HFLF-Cryo configuration. 


In order to fully exploit the high redshift smoking-gun signature of a PBH population 
of mergers, sufficient precision on the source distance is also necessary, to distinguish PBHs 
from potential Pop-III contaminants. The performance of ET on single-event-identification 
was studied in details Ref. [304, 307] adopting a Bayesian parameter estimation. A limited 
precision on the source redshift may also impact the ability to reconstruct the merger rate 
evolution at high redshift within a population analysis, as shown in Refs. [306, 308]. Based on 
a Fisher Matrix analysis, we compare the performance of different configurations in Fig. 47, 
which reports the maximum source redshift above which a relative precision on dr, better 
than 10% and 1% cannot be achieved. The relative error og, /dr, is highly impacted by the 
performance of the detector configurations at low frequencies and follows the general trend 
observed in the behaviour of the horizon (see Fig. 2) and number of detectable high redshift 
PBH events (i.e. Tables 25, 26 and 27). We refer to [312] for further details on the analysis. 

As far as the comparison of the performance of the various configurations is concerned, 
the results can be summarised as follows. i) The 10 km triangle is the one that performs less 
well. In its full HFLF-cryo configuration, for the assumed PBH population, the 2L-15km-45? 
configuration would detect about three times more events with z > 30 than the 10 km triangle. 
ij) A dramatic reduction in the number of high-redshift event is observed in the absence of 
the LF instrument, to the extent that no events are detected at redshifts z > 30, where a 
detection is identified as a PBH. This is also reflected in the behaviour of the uncertainties 
on the luminosity distance. This is due to the reduced capabilities at small frequencies, where 
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Figure 47: Contour lines of constant relative error on dr, as a function of source frame total 
mass Mo and source redshift z. Different colors indicate different configurations while each panel 
(from left to right) assume HFLF-Cryo and HF, respectively. The solid (dashed) lines corresponds to 
04; / dr = 1096 (1%). We assume equal masses m, = m» and optimal orientation of the binary. As a 
consequence of setting . = 0, aligned and misaligned 2L configurations result in the same performance 
for this kind of estimation. 


most of the high-z events are expected, as they are characterised by a large detector frame 
total mass. 


6.3.3 Other PBH signatures 


In this section, we analyse some other observables which may be crucial to discover/constrain 
the population of PBHs (see Ref. [305] for a systematic overview and [312] for further details 
on the present analysis). 

o Subsolar mass range. The distribution of PBH masses mpgg is determined by 
the characteristic size and statistical properties of the density perturbations, corresponding 
to curvature perturbations generated during the inflationary epoch. Within this standard 
formation scenario, Mppy is related to the mass contained in the cosmological horizon at the 
time of the collapse, and a wider range of masses is accessible compared to astrophysical 
BHs [313]. PBHs, in particular, can have subsolar masses, which are not expected from 
standard stellar evolution, while they can also populate the astrophysical mass gaps [286, 
314, 315]. For masses m; 2 3Mo PBHs should be distinguished from stellar-origin BHs by 
mass-spin measurements [305, 309, 316]. The range Mo € mi € 3Mo can be distinguished 
from NS thanks to tidal deformability measurements (see Section 6.1.3). Finally, the subsolar 
range represents a smoking-gun signature of the non-standard, i.e. potentially primordial, 
origin of a binary.?^? 

The Einstein Telescope will dramatically extend the reach of current technology when 
searching for mergers with at least one component below the Chandrasekhar limit [311, 319]. 


34See, however, Ref. [317] for models in which subsolar BHs are born out of dark sector interactions. Also, 
in this case, the detection of subsolar mergers would indicate new physics. 

5^While in the main text we focus on nearly equal mass mergers with q ~ 1, other potentially interesting 
systems for the detection of subsolar objects are intermediate mass ratio inspirals, given the high precision in 
measuring the mass of the light object at subpercent level [318]. Due to the relevance of the low-frequency 
range for these observations, the implementation of the LF interferometer could in principle enhance the 
capability of detecting these systems. 
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Figure 48: Detection horizons for non-spinning PBH binaries assuming a mass ratio q = 1 or 
q = 1/4, for the various detector configurations considered. This plot extends Fig. 2 in the sub-solar 
mass range, of interest for PBHs. 


In Fig. 48 we show the detection horizon in the subsolar mass range, extending Fig. 2 in the 
range of masses where only PBHs, white dwarfs, brown dwarfs, or exotic compact objects [184] 
(e.g. boson stars [320]) can appear. As a large portion of the SNR is built up in the HF range 
of the sensitivity curve, the horizons are only slightly dependent on the LF instrument. This 
feature is shown in Fig. 46, where we compare the signal amplitude for a subsolar merger 
with source frame mass M} = 1Mo located close to the horizon (i.e. having SNR = 9 in 
the A-10km-HFLF-Cryo configuration). 

In Fig. 49 we compare the performance of different ET configurations when measuring 
the source frame mass and distance of subsolar PBH mergers. In order to perform a proper 
comparison between different designs, for each mass m, we arbitrarily assume a source 
located at a distance fixed in order to have SNR= 30 in the A-10km HFLF-cryo configuration. 
We adopt the IMRPhenomHM waveform model and test both face-on (4 = 0) and edge-on 
(c = -/2) binaries. In this case we conclude that the loss of the LF sensitivity results only 
in a minor reduction of the uncertainty on mı and z. The length of the arms, which leads 
to a larger SNR, is the main factor responsible for the different performance of the various 
designs. 

o Eccentricity measurements: Another key prediction of the PBH model involves 
the eccentricity e. While formed with large eccentricity at high redshift [321, 322], PBH 
binaries then have enough time to circularize before the GW signal can enter the observation 
band of current and future detectors [305]. Therefore, PBH binary candidates must have 
small eccentricities. Let us mention here that, on a more general scope, the measurement 
of eccentricity may also inform us about the possible astrophysical formation pathway of a 
binary. While isolated formation channels predict small values of e in the observable range 
of frequencies of 3G detectors, dynamical channels predicts a fraction O(10%) with e > 0.1 
[54, 323-328]. This can be used to distinguish the origin of compact objects (see e.g. [329— 
332]). 

'To test the sensitivity of the various configurations to the measurement of a possible 
orbital eccentricity, we use the TaylorF2 inspiral-only waveform model [333-335] with the 
extension presented in [336] to account for a small eccentricity in the orbit. Using as reference 
values for the total source-frame mass Mio, = 2M and 20Mo, for each detector configura- 
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Figure 49: Relative error on the primary source frame mass (top) and source redshift (bottom) as 
a function of m in the subsolar range. We consider a source located at a distance such that the SNR 
— 30 in the A-10km HFLF-Cryo configuration. 


tion, we compute the relative error that can be attained on equal mass, non spinning systems, 
as a function of the eccentricity eg defined at fo,««, = 10Hz. The distance to the source is 
fixed to dr, = 100 Mpc for the case Mio; = 2Mo and to dr, = 500 Mpc for the other. The 
results for the relative errors attainable on eg are reported in Fig. 50. We notice, in partic- 
ular, the great relevance of the LF instrument to perform eccentricity measurements. This 
can be traced to the fact that eccentricity gives larger effects during the inspiral (i.e. at low 
frequencies) since when going closer to a merger, a binary system tends to circularize, with 
€9 X oe 15 The range of values of eg shown on the horizontal axis of Fig. 50 corresponds 
to eccentricities that are too large for a PBH binary, that are expected to have eg ~ 10-9 
when they reach f = 10 Hz [305]. Therefore, in this range, if at the same time the relative 
error on eo for a given detection is sufficiently small, e.g. below the horizontal dashed line 
in the figure (which corresponds to Aeo/eo = 0.33), one would be able to exclude that this 
event has a primordial origin. 


The information in Figs. 48, 49 and 50 can be summarised as follows: i) among the 
considered geometries, the triangle is the less performing giving e.g., uncertainties on the 
estimates of redshift about 1.4 times larger as compared to the 2L configurations, with both 
the considered ASDs. ii) the LF instrument turns out to be of great relevance for the recon- 
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Figure 50: Relative error on the eccentricity eg at fo = 10Hz as a function of its injected value for 
a binary with a total source frame mass of Miot = 2Mo and dr, = 100 Mpc in the upper panel, while 
Mo = 20M and dr, = 500 Mpc in the lower panel. For the range of eccentricities in the horizontal 
axis (that are too large for PBH binaries), one is able to exclude (at 3c level) a PBH origin for events 
that fall below the horizontal black dashed line. 


struction of the eccentricity (with a gain in relative error that can be larger than one order 
of magnitude) and to a lesser extent also in the reconstruction of the masses and distances 
of sub-solar objects that, thanks to the LF sensitivity, can stay in the band for more cycles. 


6.4 Cosmology 
6.4.1 Hubble parameter and dark energy from joint GW/EM detections 


In the context of cosmological studies, a key property of compact binary coalescences is 
that, from their GW signal, one can reconstruct the luminosity distance to the source [337]. 
Compact binary coalescences are then typically called ‘standard sirens’, by analogy with the 
‘standard candles’ that provide absolute cosmological distance measurements from electro- 
magnetic observations. Since the GW signal does not provide a direct measurement of the 
redshift, the ideal situation to constrain cosmological parameters is to measure the redshift 
of the host galaxy of the gravitational-wave source. Due to the poor localization capability 
of the gravitational-wave detectors, the host galaxy is typically identified only if we detect 
an electromagnetic counterpart (with some possible exceptions, see Section 6.4.3). There 
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are, however, a variety of statistical methods, based either on spatial correlations with galax- 
ies or large-scale structures, or on features in the mass distribution of the sources, that 
can be used to extract cosmological information even in the absence of an electromagnetic 
counterpart [9, 338-366]. Standard sirens, either with counterpart or combined to statistical 
methods, allow us to measure the Hubble parameter Ho at small redshifts while, at the larger 
redshifts that become accessible to a 3G detector, we can also explore the dark energy (DE) 
sector, through the study of the DE equation of state and of modified GW propagation. 


In this section we explore the relative potential of different ET configurations for cos- 
mology, limiting ourselves to the study of GW sources with an electromagnetic counterpart, 
while in Sections 6.4.2 and 6.4.3 we will discuss alternative possibilities. For this purpose, 
among the instruments discussed in Section 4.2, the most useful are those which enable to 
obtain precise (spectroscopic) measurement of the host galaxy redshift. In particular, follow- 
ing the analysis in [367], we consider joint GW/EM detections between ET and THESEUS. 
We consider this instrument, for the detection of the associated short GRB, since the local- 
isation capabilities (~ arcmin/arcsec) of both the XGIS and SXI instruments of THESEUS 
facilitate the follow-up by ground-based telescopes, guaranteeing a high chance of obtaining 
the redshift determination (see the discussion in Section 4.2). We consider only the prompt 
emission (see Tables 6 and 7 for the number of joint detection per year). We further consider 
the joint detections between ET and VRO, which is able to observe the kilonova emission 
produced by BNS mergers in the optical band and give a precise localization for accurate 
redshift measurements. As discussed in Section 4, this allows us to study multi-messenger 
prospects for ET both at the high redhsifts probed by GRB observations and at z 3; (0.3—0.4), 
where it will be possible to detect kilonova emission using wide field telescopes. 


We stress that the absolute estimates for the accuracy that we will obtain below for Ho, 
as well as for the parameters describing the DE equation of state and modified GW prop- 
agation introduced in the following, should not be understood as forecasts for the overall 
capability of ET for measuring these cosmological parameters. We are focusing here only on 
standard sirens observed from two specific instruments (in this case, THESEUS and VRO). 
Apart from the fact that the results also rely on assumptions on their performances and spe- 
cific observational strategies, other EM observatories are expected to operate together with 
ET, increasing the sample of detections and/or adding further complementary information. 
On top of this, ‘dark sirens’, i.e. coalescing binaries without an observed electromagnetic 
counterpart, can provide very significant information through various statistical techniques, 
as mentioned above. In particular, the correlation with galaxy catalogs already provides 
valuable cosmological information at the level of current LVC data: the statistical method 
has been applied to extract Ho from the recent LIGO/Virgo detections in a number of pa- 
pers [9, 354, 359, 368-371], and to studies of the DE sector in [354, 360, 372, 373]. These 
studies in the ET era will greatly benefit from Euclid and the Large Synoptic Survey Tele- 
scope (LSST) surveys by the VRO which will map billion of galaxies giving more precise 
photometric redshifts, and spectroscopic surveys by multi-object survey spectrographs such 
as WAVE or 4MOST. Furthermore, several other ideas have been put forward to extract cos- 
mological information from ET, see e.g. [343-345, 350, 351, 357, 358, 372, 374]. Therefore, 
an assessment of the full capabilities of ET for estimating cosmological parameters must take 
into account all these opportunities. Here, we focus on two examples, the joint GW/EM 
detections by ET+-THESEUS and by ET+VRO, with the purpose of evaluating the relative 
performances of different configurations of ET in these specific situations, rather than the 
absolute precision to which cosmological parameters can be reconstructed at ET. 
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For the ET-- THESEUS analysis, the population of BNSs and the associated EM signal 
at high energies are generated following the approach in ref. [18]. Following the same approach 
adopted in Section 4, we fix the EM signal for each BNS of the population, for the entire 
set of simulations corresponding to the different E'T configurations and geometries. As a 
reference, we consider 5 years of observations, and we select from our simulation a sample 
of events, such that among them there are 75 joint detections with the configuration 2L 
20 km HFLF cryo (see Table 6). Considering the same BNS sample, we then compute the 
number of joint detections for all the other configurations. With this method, we are correctly 
evaluating the capabilities of the ET configurations considering all of them to have the same 
total observational time, the same BNS sample and the same EM signal. In this derivation, 
the duty cycle of each ET configuration is taken into account. 


We have performed parameter estimation using both GWFISH [31] and GWFAST [17, 32], 
finding full agreement. A delicate point of the parameter estimation procedure is that the 
events that give a detectable GRB are close to face-on, where the degeneracy between the 
luminosity distance dr, and the inclination angle ; results in large errors on dr, and often 
unreliable inversion of the Fisher matrices. This can be cured by imposing a prior on 1, 
which reflects the information that, if a jet was observed from cosmological distances, the 
binary was close to face-on. A full analysis of the corresponding selection effects is quite 
non-trivial, and is in progress. For this study, we observe that all the GRBs of our catalog 
have || < (5 — 10) deg.?6 We also keep fixed the position of the source in the sky, having 
assumed that the host galaxy has been identified. We then use the simple (and common) 
prescription of setting Adr /dr = 2/SNR. This expression, with a factor of two included 
to be conservative, corresponds to the large SNR limit of the Fisher-matrix analysis when 
all other parameters are fixed. Since ų¿ and the position in the sky are the parameters that 
are most degenerate with dz, once these are kept fixed or strongly constrained by priors, 
this limiting value can become a reasonable approximation. A more detailed study on this 
is in progress. We then add in quadrature an error Adz/dr = 0.05z due to lensing, as in 
[375, 376] (see also [377] for more recent justification). From the value of z of a joint GW+EM 
event of our catalog, we then compute the nominal value of its luminosity distance using a 
fiducial ACDM model (we use the fiducial values Ho = 67.66 km s^! Mpc ^! and Qm = 0.31, 
from Planck-2018 [378]), and the error Adr, from the above procedure and then, in order to 
simulate the realistic setting of an experiment, we scatter the ‘observed’ value of dr, using a 
Gaussian distribution with the given Adz. 


In a similar way, we study the capabilities of the synergy between ET and VRO. The 
population of sources and their respective KN emission are generated through the procedure 
described in Section 4. In this case, the number of events considered corresponds to one year 
of observations, rather than 5, due to the larger number of potential joint detections (see 
Section 4 and Table 10 for details). We consider the observational strategy consisting on 
following-up all the events localized with A99, < 40 deg?, two filter (g and i) observations 
repeated the first and second night after the merger and an exposure time for each pointing 
of 600 s. In particular, for each configuration we consider a number of sources matching the 
ones reported in column 3 of Table 10. Note that, thanks to its isotropic nature, the KN 
emission can be observed irrespectively of the binary inclination, resulting in a sample of 
events with better conditioning for the Fisher matrix analysis, thanks to a milder degeneracy 
between v and dy. 


36 More precisely, to take into account the periodicity of 1, they have min (+, 180? — 1) < (5 — 10) deg. 
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Figure 51: Reconstruction of the parameters Ho and Qm in ACDM, from the joint GW+EM events 
obtained with ET+THESEUS in 5 yr of observations, for the different geometries of ET shown, all 
with their HFLF-cryo sensitivity. 


We now present the results obtained for (Ho, (5,0), for the DE equation of state, and 
for modified GW propagation. 


Hubble constant. In a flat ACDM model, the relation between the luminosity distance 
dr, and the redshift is 


di(z) = (6.9) 


— (1+2) jf 
v 9u( 1-42)? F” 


where Qm is the present matter fraction and Q4 is the energy density fraction associated with 
the cosmological constant (we neglect the contribution from radiation, which is completely 
negligible at the redshifts relevant for standard sirens. In this approximation Q4 = 1 — Oy 
for a flat ACDM cosmology). At low redshift this reduces to Hubble's law dz(z) ~ (c/ Ho)z, 
so standard sirens at low redshift can allow us to measure Ho. 

The first measurement of Ho from a standard siren with counterpart has been obtained 
from GW170817 [379]. However, the error from this single detection is still too large to 
discriminate between the value of Hp obtained from late-Universe probes [380, 381], and 
that inferred from early-Universe probes assuming ACDM [382, 383], which are currently in 
disagreement at 5.30 level. 

Our results for Hp and Qm are shown in Fig. 51 and Table 28 for ET+THESEUS (that 
shows the result of a typical realization of the catalog of joint detections, among several 
that we have generated), and in Fig. 52 and Table 29 for ET--VRO. We should stress that 
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Configuration AHo/ Ho Ay /QqM 
A-10km 0.057 0.546 
A-15km 0.035 0.290 
2L-15km-45° 0.040 0.370 
2L-20km-45° 0.029 0.276 


HFLF cryogenic 
Configuration | AHo/Ho | AQy/Qu 
A-10km 0.009 0.832 
A-15km 0.007 0.303 
2L-15km-45° 0.006 0.370 
2L-20km-45° 0.004 0.243 


Qin, 0 


Figure 52: Reconstruction of the parameters Ho and Qm in ACDM, from the joint GW+EM events 
obtained with ET+VRO in 1 yr of observations, for the different geometries of ET shown, in the left 
panel with their HFLF-cryo sensitivity and in the right panel with the HF instrument only. 


Table 28: Relative errors on Ho and Qm in ACDM (median and symmetric 68% CI), from the 
joint GW+EM events obtained with ET+THESEUS, for the different geometries of ET shown, all 
with their HFLF-cryo sensitivity. We stress that no prior from electromagnetic observations, such as 
CMB+BAO-+SNe, is used here; with such priors, the accuracy on Ho becomes sub-percent. 


HF only 
Configuration | AHo/Ho | AQ m/m 
A-10km 0.065 1.23 
A-15km 0.057 1.86 
2L-15km-45? 0.066 1.31 
2L-20km-45? 0.031 1.22 
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Table 29: Relative errors on Ho and Qm in ACDM (median and symmetric 68% CI), from the 
joint GW+EM events obtained with ET+VRO in 1 yr of observations, for the different geometries 
of ET shown, in the left table with their HFLF-cryo sensitivity and in the right table with the HF 
instrument only. 


these are the results obtained from standard sirens only, without any prior from cosmological 
electromagnetic probes. Of course, CMB, BAO, SNe, structure formation, and other electro- 
magnetic probes provide much stronger constraints on Qm; assuming ACDM, Planck-2018 
fixes Qm = 0.315 + 0.007 [378], ie. AO /Q ~ 0.022, much smaller than what can be 
obtained from standard sirens alone. In principle, one could then combine cosmological and 
standard sirens data, as in [367], reducing significantly the error on Ho.?' In any case, for 
our present purpose of comparing the different geometries, Figs. 51 and 52 and Tables 28 
and 29 contain the information that we need, on the relative performances of the different 
ET configurations. 

We see that the best results for Ho are obtained by ET+VRO, whose detections are 
potentially more numerous and at small redshift, where the impact of Ho is bigger. However, 
we notice that the results obtained for THESEUS are very solid and conservative: they are 
based on the current observations of short GRBs and consider only the prompt emission 
(including the afterglow detections by XGIS and SXI will increase the detections per year 
by a factor of about 3). In contrast, the VRO estimates strongly depend on the BNS merger 
rate normalization (which, on the basis of the GW observations by the LIGO and Virgo 
detectors, can be one order of magnitude smaller or larger). 


Concerning the comparison of configurations we find that the triangle-15km and the 2L- 
15km-45° have quite similar accuracy on Ho for this test, and improve by a factor ~ (1.3—1.6) 
the result of the 10 km triangle, for ET+THESEUS and ET+VRO (with ET taken in both 
cases in the HFLF-cryo configuration). From the ET+VRO results, we also see the huge 
impact of the LF instrument, which allows us to observe a larger number of sources with good 
localization and results in constraints more than a factor of 7 tighter on Hg and more than 
a factor of 2 tighter om Nm o. 

Observe that the reduction of joint ET+VRO detections is dramatic if we lose the 
low-frequency: only 696 of the events detected by the triangle-10 km HFLF-cryo, and only 
1196 of those detected by the 2L-15km HFLF-cryo, are observed by the corresponding HF- 
only configurations, see columns 4 of Table 11 and Table 10. In contrast, for the joint 
ET+THESEUS detections the reduction is much smaller: 6096 of the events detected by the 
triangle-10 km HFLF-cryo and 70% of those detected by the 2L-15km HFLF-cryo are ob- 
served by the corresponding HF-only configurations, see columns 4 of Table 11 and Table 10. 
Taking into account that the detections are distributed in the same redshift range (see for 
example Fig. 23), the observations of THESEUS together with ET-HF become comparable 
to those obtained by THESEUS together with ET at full HFLF-cryo sensitivity, in a observa- 
tion time longer by a factor ~ 1.5 (which could also be partly compensated if the duty cycle 
of the HF-only instrument will be larger than that of the full HFLF-cryo configuration). 


Dark energy equation of state An accurate measurement of the Hubble parameter 
with GWs would be of course very interesting a priori. However, it is in principle possible 
(although by no means guaranteed) that, by the time that 3G detectors will be operative, 


3’The limits on Ho reported here, obtained from GWs+GRB only, are consistent with the results in [367], 
after taking into account that we are considering 5 years of data (which is more consistent with updated 
estimates for the expected duration of the THESEUS mission), while the data in [367] were relative to a 
10 yr period, and that we use a model for short GRBs normalized to the observations (which takes into 
account the fact that not all BNS merger produce a GRB), resulting in a smaller number of joint GW--GRB 
detections. As shown in [367], if one further combines the constraints from joint GW+EM detection with 
purely electromagnetic observations from CMB, BAO, SNe, etc, the accuracy on Ho reaches the subpercent 
level. 
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Figure 53: Reconstruction of the DE equation of state parameters wo and Wa, from the joint 
GW-+EM events obtained with ET+'THESEUS in 5 yr of observations, for the different geometries of 
ET shown, all with their HFLF-cryo sensitivity. 


the discrepancy on Ho between late-Universe and early-Universe probes might have already 
found a resolution. More specific to 3G detectors is the possibility of testing the cosmological 
model with GWs up to moderate and large redshifts, where signatures of a dynamical dark 
energy could be found. 


In general, on cosmological scales, one performs a separation between the homogeneous 
Friedmann-Robertson-Walker (FRW) background, and scalar, vector and tensor perturba- 
tions over it; tensor perturbations, when they are well inside the cosmological horizon, are 
just GWs propagating on the FRW background. Whenever gravity is modified on cosmo- 
logical scales, both the background evolution and the dynamics in the sectors of scalar and 
tensor perturbations are modified (vector perturbations usually decay and are irrelevant). 
Here we discuss the effect of dark energy on the background evolution, which, as we will 
recall, can be described by an effective dark energy (DE) equation of state, while below we 
will discuss the modification of the tensor sector related to modified GW propagation. 

At the background level, the effect of a dynamical dark energy component is described 
by the DE density ppg(z) and by its pressure ppg(z). Equivalently, one can introduce the 
DE equation of state wpg(z), defined by ppg(z) = wpz(z)ppge(z). The standard ACDM 
model is recovered for wpg(z) = —1. Using the conservation of the DE energy-momentum 
tensor, one finds that the DE density is given as a function of redshift by 


ppE(z) = po€)pg exp ef n [1+ woel) } ; (6.10) 
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Configuration | Awọ | Awa 
A-10km 0.49 | 3.81 
A-15km 0.40 | 2.65 
2L-15km-45? | 0.35 | 2.55 
2L-20km-45? 0.34 | 2.40 


Table 30: Absolute errors on the DE equation of state parameters wg and wa (symmetric 68% 
CI), from the joint GW+EM events obtained with ET+THESEUS, for the different geometries of ET 
shown, all with their HFLF-cryo sensitivity. 
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Figure 54: Reconstruction of the DE equation of state parameters wo and Wa, from the joint 
GW-+EM events obtained with ET+VRO in 1 yr of observations, for the different geometries of 
ET shown, in the left panel with their HFLF-cryo sensitivity and in the right panel with the HF 
instrument only. 


where Qpe = ppe(0)/po is the DE density fraction and po = 3H?/(87G) is the critical 
density. At the background level, the properties of DE are therefore encoded just in one 
function wpg(z). The corresponding expression for the luminosity distance (neglecting again 
radiation) is 


c 7 dz 
iae Zara f - T (6.11) 
Ho 0 VYOQu(1 + 2)? + pog(2)/po 
and reduces to eq. (6.9) when ppg(Z) is a constant, i.e. when wpp(z) = —1. Any redshift 
dependence of ppg, i.e. any deviation of the DE equation of state from the ACDM value 
WpE = —1, would provide evidence for a dynamical dark energy. 


In general, it is difficult to extract from the data a full function of redshift, such as 
wpg(z), and a parametrization in terms of a small number of parameters is useful. For the 
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HFLF cryogenic HF only 
Configuration | Awo | Awa Configuration | Awg | Awa 
A-10km 0.31 | 3.92 A-10km 0.85 | 4.26 
A-15km 0.23 | 3.69 A-15km 0.56 | 3.84 
2L-15km-45° 0.14 | 2.49 2L-15km-45? 0.37 | 4.13 
2L-20km-45? 0.13 | 2.89 2L-20km-45? 0.27 | 3.52 


Table 31: Absolute errors on the DE equation of state parameters wo and wa (symmetric 68% CI), 
from the joint GW+EM events obtained with ET+VRO, for the different geometries of ET shown, 
in the left table with their HFLF-cryo sensitivity and in the right table with the HF instrument only. 


DE equation of state a standard choice is the (wo, wa) parametrization [384, 385], 


z 

wpE(z) = wo + mo“ . (6.12) 
Combining eq. (6.11) with eqs. (6.10) and (6.12), from the measurements of the luminosity 
distance and redshift of an ensemble of sources we can perform the inference on the cosmolog- 
ical parameters Ho, QM, wo, Wa. The constraining power of GWs alone is too weak to provide 
useful information on all these parameters (as we already saw just for (Ho, Qm) in Table 28), 
so one is obliged to combine standard sirens with other cosmological probes. In this context, 
it is then common to assume that Ho, Qm are simply fixed, or strongly constrained by pri- 
ors obtained from electromagnetic probes, and keep only (wo, wa) in the inference. Several 
forecasts of (wo, Wa) for ET already exist in the literature [367, 375, 376, 386]. Here we 
present updated results using our catalog of joint ET+THESEUS detections, for the vari- 
ous ET geometries considered, as well as ET+VRO for the various geometries and the two 
ASDs (as we saw above, in the case of ET+THESEUS, the loss of the LF instrument is 
much less dramatic, and basically can be compensated by increasing by a factor ~ 1.5 the 
observation time, which does not affect the relative performances of the geometries). The 
results are shown in Fig. 53 and Table 30 for ET+THESEUS and in Fig. 54 and Table 31 
for ET--VRO.?* 

For ET+THESEUS, we see that the differences in performances are not large between 
the various geometries: the 2L 15km 45? and the 2L 20km 45? are quite comparable and 
slightly better than the 15-km triangle, while the 10 km triangle performs less well (e.g. by a 
factor 1.4 on wo, with respect to the 2L 15km 45°). For ET+VRO the situation is similar, 
with slightly larger differences (the 10 km triangle performs less well, with respect to the 
2L 15km 45°, by a factor 2.2 on wo). We also notice again the improvement brought by the 
LF instrument for the VRO+ET case, with results tighter on wo by a factor ~ (2.4 — 2.7), 
depending on the geometry. 


Modified GW propagation As we have discussed, the DE equation of state characterizes 
the impact of dark energy on the background evolution and, as such, can also be studied 
with purely electromagnetic probes. Current Planck-2018 analysis from CMB, combined 
with SNe and BAO, including both wp and wa in the inference, find wg = —0.961 + 0.077 
and Wa = —0.28*03. (6896 c.l.) [378]. While GW observation can provide an independent 


38'T'he results for ET--THESEUS, obtained with GWFAST, have been double-checked using TiDoFM, finding 
broad agreement. 
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probe of these quantities, it is not evident that even 3G detectors will be able to improve 
quantitatively on these accuracies. As we see from Tables 30 and 31, this is certainly not the 
case for the specific example of joint ET+-THESEUS or ET+VRO observations that we are 
considering, where we are well above the ~ 7% accuracy on wo that we already have from 
CMB+BAO+SNe. For instance (recalling that in ACDM wo = —1, so the absolute errors 
on wo reported in Tables 31 and 30 are the same as the relative errors), for ET--VRO the 
configuration 2L-15km-45? reaches an accuracy on wo of 14%, while the 10 km triangle of 
31%. 

As emphasized in [386, 387], in the tensor sector the situation is different, and potentially 
much more interesting, for two reasons: (1) Modified GW propagation can only be tested 
from GW observations at large redshift, and therefore current limits (see below) are much 
broader. (2) There exist phenomenologically viable and theoretically motivated modified 
gravity models [388] that comply with all observational bounds at the background level and 
in the scalar perturbation sector (where, therefore, they differ from ACDM by at most a few 
percent level) and, nevertheless, in the tensor sector predict deviations from GR and ACDM 
that could be as large as 80% [389, 390]. 

The effect is due to a modification of the equation that governs the propagation of tensor 
perturbations over FRW. In GR, this is given by 


h^, +2Hh + ekha — 0, (6.13) 


where ha(n,k) is the Fourier-transformed GW amplitude, A = +, x labels the two GW 
polarizations, the prime denotes the derivative with respect to cosmic time n [which is defined 
by dn = dt/a(t)], a(7) is the FRW scale factor, and H = a'//a. Whenever gravity is modified 
on cosmological scales, unavoidably this propagation equation is also modified, as has been 
seen on many explicit examples [389-395]. A change in the coefficient of the k?h4 term 
induces a speed of GWSs, cg, different from that of light. After the observation of GW170817, 
this is now excluded at a level |cgw — c|/c < O(10 ^19) [3]. However, the modified gravity 
models that pass this constraint still, in general, induce a change in the 'friction term', so 
the propagation equation for tensor modes becomes, 


h^ + Zu —ó(m)]h4 + ekha — 0, (6.14) 


for some function 6(7) that encodes the modifications from GR. In GR, using eq. (6.13), one 
finds that the GW amplitude decreases over cosmological distances as the inverse of the FRW 
scale factor. From this one can show that, after propagation from the source to the observer, 
the amplitude of the GW from a compact binary coalescence is proportional to the inverse 
of the luminosity distance to the source (see, e.g. Section 4.1.4 of [396]). This is at the origin 
of the fact that compact binaries are standard sirens, i.e. that the luminosity distance of the 
source can be extracted from their gravitational signal. However, when the GW propagation 
is rather governed by eq. (6.14), this result is affected. It can then be shown that the quantity 
extracted from GW observations is no longer the standard luminosity distance dr (z) of the 
source [that, in this context, we will denote by d;™(z), since this is the quantity that would 
be measured, for instance, using the electromagnetic signal from a counterpart]. Rather, the 
quantity extracted from GW observation is a ‘GW luminosity distance’ d7" (z) [387], related 
to d?" (z) by [386, 387] 


d£" (2) = dg" (2) epf- f Sa} , (6.15) 
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Figure 55: Reconstruction of the parameters £g and n in modified gravity, from the joint GW+EM 
events obtained with ET+-THESEUS in 5 yr of observations, for the different geometries of ET shown, 
all with their HFLF-cryo sensitivity. 


Configuration | AEg/Eo | An 
A-10km 0.071 2.87 
A-15km 0.040 3.01 
2L-15km-45° 0.049 2.91 
2L-20km-45° 0.030 3.12 


Table 32: Errors on the parameters =o and n that describe modified GW propagation (median and 
symmetric 68% CI), from the joint GW+EM events obtained with ET+THESEUS, for the different 
geometries of ET shown, all with their HFLF-cryo sensitivity. 


where the function 6 that appears in eq. (6.14) has now been written as a function of redshift. 

Similarly to what we have discussed for the DE equation of state, inference on a full 
function of redshift such as ó(z) is difficult (although it can be performed, to some extent, 
with the technique of Gaussian process reconstruction [397]), and it is convenient to introduce 
a parametrization. A very convenient choice, in terms of two parameters (Zo, n), has been 
proposed in [386]. Rather than parametrizing ó(z), it is simpler to parametrize directly the 
ratio d?” (z)/d?™(z) (which is also the directly observed quantity), in the form 

df(z 4 , 1-8 


This parametrization reproduces the fact that, at z = 0, d?" /df" = 1 since, as the distance 
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Figure 56: Reconstruction of the parameters £g and n in modified gravity, from the joint GW+EM 
events obtained with ET+VRO in 1 yr of observations, for the different geometries of ET shown, in 
the left panel with their HFLF-cryo sensitivity and in the right panel with the HF instrument only. 


HFLF cryogenic HF only 
Configuration | AEo/Zo | An Configuration | AEo/Zo | An 
A-10km 0.097 | 3.04 A-10km 0.617 | 2.80 
A-15km 0.081 | 2.95 A-15km 0.007 | 2.87 
2L-15km-45? | 0.038 | 2.95 2L-15km-45? | 0.108 | 2.88 
2L-20km-45* | 0.027 | 2.59 2L-20km-45? | 0.063 | 3.00 


Table 33: Errors on the parameters Eg and n that describe modified GW propagation (median 
and symmetric 68% CI), from the joint GW+EM events obtained with ET+VRO, for the different 
geometries of ET shown, in the left table with their HFLF-cryo sensitivity and in the right table with 
the HF instrument only. 


to the source goes to zero, there can be no effect from modified propagation. In the opposite 
limit of large redshifts, in contrast, eq. (6.16) predicts that d?" /df" approaches a constant 
value Zo. This is motivated by the fact that, in typical modified gravity models, the deviations 
from GR only appear in the recent cosmological epoch, so 6(z) goes to zero at large redshift, 
and the integral in eq. (6.15) saturates to a constant. The parametrization (6.16), interpolates 
between these two limiting behaviors, with a power-law determined by n, and GR corresponds 
to Eo = 1 (for any n). This simple parametrization has been shown to work remarkably 
well for practically all best-studied modified gravity models [395]. A particularly interesting 
example is given by the model proposed in [388] (see [390, 398] for reviews) that, while 
complying with all current cosmological constraints, predicts modified GW propagation with 
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a value of Eo that (depending on a single parameter related to the initial conditions of the 
cosmological evolution) can differ from the GR value Eg = 1 by an amount between a few 
percent up to a value Eo œ 1.8 [389, 390], that would correspond to a 80% deviation from 
GR. Limits on =p have already been obtained from the current set of LVC detections, either 
from GW170817 as a standard siren with counterpart [386], or using BBH dark sirens from 
the O1, O2 and O3a LIGO/Virgo run, and performing a correlation with galaxy catalogs. 
This results in the value Eg = 2.1773 [354].?? Using instead a joint cosmology-population 
analysis that exploits the mass scales in the BBH mass function [350, 372] and the GWTC-3 
catalog of detections [6] one finds the constraints £g = 1.2 + 0.7 [360]. 

Here we show the constraint on Zo that can be obtained from the joint set of GW 
and electromagnetic observations obtained from ET+THESEUS and ET+VRO. As we have 
stressed at the beginning of this section, the absolute estimates for the accuracy that we will 
present should not be understood as the forecast for the overall capability of ET for measuring 
these cosmological parameters, since we are studying the information that could be obtained 
from two examples, namely the joint detections of ET+THESEUS and ET+VRO, and, as 
discussed at the beginning of this section, there are other ways of obtaining information on 
Eo from the ET data. Again, here we are mainly interested in the relative performances of 
the different ET configurations on some concrete examples. 

Fig. 55 and Table 32 show the results for the parameters =ọ and n that characterize 
modified GW propagation from the ET+THESEUS joint detections, while Fig. 56, and Ta- 
ble 33 show the results for ET--V RO. The hierarchy of performances is similar to that for Ho, 
with the 2L 20 km providing the best results, followed by the 15 km triangle and the 2L 15 km 
with similar performances (with 2L-15 km better than triangle-15km for ET+VRO), and the 
10 km triangle providing the less good performances. For ET--VRO, the huge improvement 
as a consequence of the inclusion of the LF instrument is also apparent. 

Observe that, in view of the discussion above, the accuracy that can be obtained from 
Eo even from these two specific examples, which ranges between 3% and 10% depending on 
the geometry with the full sensitivity curve, is already very interesting in absolute terms, 
since current lo limits are at the 60% — 100% level, and furthermore there are models that 
predicts a potentially much higher signal. 


6.4.2. Hubble parameter and dark energy from BNS tidal deformability 


BNS have an intrinsic mass scale and can only exist in a narrow range of masses. This mass 
scale is imprinted in the tidal interaction between the component NSs. Therefore, if the 
nuclear EoS is known, one can determine the source-frame masses by a measurement of the 
tidal deformability. This, in turn, would allow the measurement of the redshift directly from 
a GW observation because it is the redshifted mass that is inferred from the point-particle 
approximation of the waveform. Such a method was first proposed in [343] and further 
explored in [400, 401]. A measurement of the Hubble constant using a known relationship 
between the tidal parameter and source-frame mass was explored in [346, 356, 364] while [363] 
showed that one can simultaneously estimate both the nuclear EoS and the Hubble constant 
using future observatories. A measurement of the dark energy EoS was explored in [352, 361]. 

In this section, we explore the potential of different ET configurations to constrain the 
expansion history of the Universe assuming that the nuclear EoS is known. It is found in [356] 


5?An even more stringent result is obtained if one accepts the tentative identification of the flare 
ZTF19abanrhr as the electromagnetic counterpart of the BBH coalescence GW190521. Then, the analysis in 
[354] gives Eo = 1.8793. (6896 c.l.). This is consistent with similar results obtained in [399]. 
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that up to a 1596 uncertainty in the knowledge of the EoS does not affect the measurement 
of the Hubble constant in a meaningful manner. We use the TaylorF2 waveform model 
augmented with the 5PN and 6PN tidal terms in the phase, terminating the signal at the 
ISCO frequency corresponding to the total mass of the binary. In Section. 6.1.3, it was found 
that the results using this model versus an inspiral-merger-ringdown (IMR) waveform gives 
similar results. Since the analysis in this section is along similar lines, we expect the same to 
hold here too. Additionally, we assume the DD-LZ1 EoS for the NS for reasons elucidated in 
Section 6.2.1. We fit the logarithm (base 10) of the tidal deformability as a function of the 
mass of the NS using a fourth-order polynomial given by 


logio A(m) = —0.3550 m^ + 2.162 m? — 4.652 m? + 2.514 m + 3.892, (6.17) 


where m is in units of Mo. We verify that the fit reproduces the slope of the curve accurately 
with maximum errors at a few percent. This is crucial because it is the slope of the curve 
that contributes to the Fisher errors on the redshift. 

The Fisher errors from the dr-z space are then propagated to the space of cosmological 
parameters, d, via another Fisher matrix given by [402] 


_ 1 dk (z) adh (2) 
Gij 2. M "m Op) , (6.18) 


where N is the total number of observations in the catalog and Ci, , is the total luminosity 
distance error for the k-th event given by 


(ca, = (o4, + (%,)?- (6.19) 


Here, e is the contribution to the luminosity distance error due to the error in the GW 
amplitude while oj, is that due to the error in the redshift measurement, given by 


,  |8D, 
Era 


Oz. (6.20) 


In writing eq. (6.18), we have neglected the correlations in the dr.—z space for simplicity. 


Hubble constant. The results for Hp and Qum are shown in Fig. 57 and Tab. 34. A 
description of ACDM cosmology is given in Section 6.4.1. We see that a single year of 
observing run can achieve sub-percent errors on Ho with ET alone, albeit the errors improve 
by an order of magnitude when ET operates in a network with 2CE. The ET configuration 
with two 20km L-shaped detectors at 45? gives the best constraints. Similar trends follow 
for the dark matter energy density parameter, Qm, which can be constrained at percent level 
precision, except for the case of a standalone 10km triangular ET which can only achieve an 
accuracy of ~ 1096. 


Dark energy equation of state. The results for the dark energy equation of state param- 
eters are shown in Fig. 58 and the 1c errors tabulated in Tab. 35. A description of the dark 
energy model considered here is in Section 6.4.1. We see that the 2L 20km at 45? obtains 
the best constraints on the parameters, albeit the EoS parameter wa is not constrained by 
a standalone ET if one marginalises over the ACDM parameters. In such a scenario, it is 
common to fix the ACDM parameters to that measured with EM probes and measure only 
the dark energy parameters using GWs. In this case, except for the single 10km triangular 
ET configuration, the EoS parameters can be constrained. 
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Figure 57: Constraints on the parameters Hp and Qm in ACDM model using one year GW obser- 
vations from BNS alone for different ET geometries. The covariance for a standalone ET is shown 
in the left panel while ET in a network with 2CE is shown in the right panel. H is measured in 
kms~! Mpc !. 


Configuration AHo/ Ho AQOy /Oy Configuration AHo/ Ho AOy /Qu 
A-10km 9.63 x 107? | 1.10 x 1071 A-10km +2CE 7.35 x 1074 | 4,40 x 1073 
A-15km 7.20 x 1073 | 6.62 x 10 A-15km +2CE 6.35 x 10-4 | 3.71 x 1073 
2L-15km-45? | 7.59 x 1073 | 7.47 x 10? 2L-15km-45? +2CE | 6.54 x 1074 | 3.84 x 1078 
2L-20km-45? | 5.90 x 107? | 5.04 x 107? 2L-20km-45? +2CE | 5.79 x 107 | 3.30 x 1073 


Table 34: Standard deviation on the parameters Ho and Qm in ACDM using one year of GW 
observations from BNS alone for different geometries of ET alone (left) and ET in a network with 
2CE (right). ET is always taken with the full HFLF-cryo sensitivity. 


Modified GW propagation. We next consider modified gravitational wave propagation, 
using the parametrization (6.16). The bounds on the corresponding parameters Eg and n 
are shown in Fig. 59 and the corresponding error (at 68% c.l.) are tabulated in Tab. 36. A 
difference from the analysis of the previous section using EM counterparts is that we assume 
fiducial values of Eg 1.1 and n = 2.5. As described in Section 6.4.1, these values are 
consistent with all current observations. We do this because the form of the parameterization 
(6.16) renders the Fisher matrix (6.18) singular for GR value of Eo = 1 and for n = 0. Any 
value of n away from n = 0 is sufficient to make the Fisher matrix well-defined. We choose 
n = 2.5 for reasons outlined in [367]. We see that the hierarchy among the configurations is 
analogous to that found in Table 32, with the 2L 20km with arms at 45? providing the best 
results, followed by 2L 15km with arms at 45? and triangle 15km, that gives quite similar 
results, while the triangular 10km ET gives the largest errors. 


6.4.3 Hubble parameter from high-mass ratio events 


A main limitation for the reconstruction of the luminosity distance dry, is the fact that it is 
highly correlated with the angle & between the binary's orbital angular momentum and the 
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Figure 58: Constraints on the two—w,; plane in a dynamical dark energy model using one year GW 
observations from BNS alone for different ET geometries. The covariance for a standalone ET is 
shown in the left panel while ET in a network with 2CE is shown in the right panel. 


Configuration | Awo | Awa Configuration Awo | Awa 
A-10km 0.348 | 5.79 A-10km +2CE 0.0241 | 0.231 
A-15km 0.217 | 2.98 A-15km +2CE 0.0207 | 0.193 
2L-15km-45? 0.210 | 3.19 2L-15km-45? +2CE | 0.0211 | 0.198 
2L-20km-45? | 0.153 | 2.04 2L-20km-45? --2CE | 0.0189 | 0.169 
Configuration | Awo | Awa Configuration Awo Aw, 
A-10km 0.129 | 1.02 A-10km +2CE 7.07 x 10^? | 0.0405 
A-15km 0.0940 | 0.644 A-15km +2CE 6.08 x 10-? | 0.0343 
2L-15km-45? 0.111 | 0.773 2L-15km-45? +2CE | 6.29 x 10? | 0.0356 
2L-20km-45? | 0.0823 | 0.532 2L-20km-45° +2CE | 5.50 x 10? | 0.0306 


Table 35: Errors on the parameters wo and wa (6896 c.l.) in a dynamical dark energy model using 
one year of GW observations from BNS alone for different geometries of ET alone (left) and ET in a 
network with 2CE (right), all with their HFLF-cryo sensitivity. The top panel shows the constraints 
marginalised over the parameters Hp and Qm whereas, in the bottom panel, the ACDM parameters 
are assumed to be known a priori. 


observer's line-of-sight. Purely based on the signal-to-noise ratio of an event, for large SNR 
we should expect that the luminosity distance dr, of a source is measured to an accuracy 
of Adz /dr, = 1/SNR. However, this limit is never reached since this correlation (as well as, 
to lesser extent, the correlation with sky location) degrades the accuracy, typically leading 
to error in the distance measurement an order of magnitude larger [403] (unless the angle & 
and the sky location of the source can be fixed by the observation of an associated GRB, see 
the discussion on page 96). For instance, when the inclination angle changes from 0 to 7/6 


^^ For precessing binaries ; would be a function of time since the orbital angular momentum precesses about 
the total angular momentum, which includes the companions’ spin angular momenta. For such systems, . 
should be defined at some fixed, but arbitrary, instant of time. 
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Figure 59: Constraints on the Zo-n plane in a modified gravity model with modified tensor pertur- 
bations using one year GW observations from BNS alone for different ET geometries. The covariance 
for a standalone ET is shown in the left panel while ET in a network with 2CE is shown in the right 
panel. The ACDM parameters are assumed to be known a priori. 


Configuration AzEo/=o An Configuration AEo/=o An 

A-10km 1.06 x 1072 | 5.91 x 107? A-10km +2CE 3.71 x 1077 | 5.78 x 107? 
A-15km 7.01 x 1073 | 4.45 x 107? A-15km +2CE 3.07 x 1077 | 4.99 x 107? 
2L-15km-45? | 8.41 x 107? | 4.29 x 107? 2L-15km-45? +2CE | 3.22 x 1074 | 5.21 x 107? 
2L-20km-45? | 5.88 x 107? | 4.37 x 10? 2L-20km-45? +2CE | 2.71 x 10-4 | 4.99 x 1078 


Table 36: Errors (at 68% c.l.) on the parameters £o and n that describe modified GW propagation, 
using one year of GW observations from BNS alone for different geometries of ET alone (left) and 
ET in a network with 2CE (right), all with their HFLF-cryo sensitivity. The ACDM parameters are 
assumed to be known a priori. 


radians, the relative amplitudes of the hy and hy polarizations of the dominant quadrupole 
mode change by no more than 1%, while the overall amplitude reduces by 12%. This change 
in amplitude can be entirely compensated by moving the binary 1296 closer to the observer 
without an appreciable change in the polarization content of the observed signal. 

This scenario changes dramatically for high-mass ratio events from either neutron star- 
black hole or double black hole binaries. Significant energy in the higher-order modes can 
break the distance-inclination angle degeneracy since the polarization modes have a different 
dependence on the inclination angle. In a recent paper [404] it has been shown that high- 
mass ratio events with large SNR would not only break the dz- degeneracy, but a small 
population of such events would also localize the source to a region small enough to contain, 
on average, just one L, galaxy. This helps in obtaining the redshift of the source from the 
galaxy, although the source itself may not have an EM afterglow. As already mentioned, these 
events are called dark sirens and can measure the Hubble constant without the need for an 
extra distance calibrator. Some of these events, albeit very rare, could measure the Hubble 
constant to an accuracy of better than 296 with a single event with no known systematics 
and are called golden Ho events. This method could be one of the most precise and accurate 
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Figure 60: The accuracy with which the Hubble constant can be measured by different detector 
networks by high-mass ratio binary black holes, with events corresponding to 1 and 2 yrs of observation 
time picked randomly from the 10-year catalog of BBH events located within z = 0.1. In the left plot, 
Ho is measured with no prior imposed on the dark matter energy density Qm, while for the right 
plot we assume a gaussian prior with a width of 0.017. The markers show the median value of the 
fractional error in Ho and the error bars denote the 68% confidence region. 


way of measuring the Hubble constant. 

Since BBHs are not likely to be associated with an EM counterpart, the only way to 
identify the true host is to localize the source to a single galaxy. For NSBH events, however, 
we expect EM afterglows if the mass ratio is not too small (say, greater than about 1:4). While 
asymmetric masses in a NSBH binary could help break the distance-inclination degeneracy, 
the host galaxy could be identified from the kilonova. Additionally, for NSBH binaries with 
large mass asymmetry, it is also possible to identify the host within the sky localization from 
gravitational wave observations. In this sense, NSBH binaries could be used as both bright 
and dark sirens or gray sirens. 

For BBH systems, we restrict our population to events within the redshift z = 0.1, or 
476 Mpc. This leaves us with 86 BBH mergers in a span of 10 years. Among these, we 
choose those events for which the sky area is constrained to better than 0.04 deg”, as we 
expect only one L, galaxy to lie in this sky patch up to a redshift of 0.1 [405]. For the 
events that qualify, we obtain the measurement errors in luminosity distance by calculating 
the Fisher information matrix. Assuming the ACDM cosmology, the errors in dr, can be 
converted to measurement uncertainty in Ho after accounting for the measurement errors in 
the matter density, Qm. We present the bounds on Ho for two cases: without taking a prior 
on Qm, and with a gaussian prior on Qm. The width of the gaussian prior is fixed to the 
sum of the squares of the bounds placed on Qu by Planck [378] and SHOES [406], which is 
equal to 0.017. Using binary black holes as dark sirens, the accuracy with which Hp would 
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Figure 61: Same as Fig. 60 except the results are obtained for the NSBH population instead of the 
BBH population. Moreover, the results were obtained by combining measurement accuracies from 
systems that produce kilonovae in addition to those that identify a unique host within the localization 
error box. 


be measured by different detector configurations in a network of ET and two CE detectors, 
one of which is a 40 km arm length detector and the other 20 km, is shown in figure 60. 

For NSBH systems, the Hubble constant can be measured with gravitational waves 
both without (dark sirens) and with (bright sirens) electromagnetic counterpart. From the 
10 year population, we choose events that qualify as dark or bright sirens (and make sure to 
not double-count events that qualify as both). The NSBH events that qualify as dark sirens 
are chosen in the same way as was done for BBH systems. For bright sirens, we only consider 
events that lie within a redshift of z — 0.5, as we do not expect to detect the kilonovae from 
events that lie farther. In addition, only those events are selected for which the sky position 
is measured better than 9.6 deg?, i.e. the field of view of the Rubin observatory. For these 
events, we generate the kilonova light curves with DD2 as the neutron star equation of state 
using the numerical recipes from [407, 408]. We claim that a particular kilonova is detected 
with the Rubin observatory by using the same criteria that were used in section 4.3, i.e. the 
luminosity of the kilonova should be brighter than the limiting magnitude (corresponding 
to 600s of exposure) of the g and the i filters on two consecutive nights. We also assume a 
5096 duty cycle for the Rubin observatory. The events for which the corresponding kilonova 
is detected qualify as bright sirens and are considered for the measurement of the Hubble 
constant. The luminosity distance errors for both dark and bright siren events are combined 
and the measurement errors for f) are accounted for in the same way as was done for BBH 
dark siren events. The accuracy in measurement of Ho using this gray siren approach with 
NSBH systems is shown in figure 61. 

Among the events that contribute to the measurement of the Hubble constant, there are 
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Configuration BAN ‘eeu 
With 2CE Without 2CE | With 2CE Without 2CE 
A 10 km 68 5 21 0 
A 15 km 74 7 30 0 
2L 15 km 45° 73 4 25 1 
2L 15 km 0° 69 0 21 0 
2L 20 km 45° 77 8 37 1 
2L 20 km 0° 70 0 22 0 


Table 37: The number of dark siren BBH golden events and gray siren NSBH golden events for 
different ET detector configurations in an observation time of 10 years. 


certain golden events for which the accuracy in the measurement of the luminosity distance 
is better than 2%. Such events can individually measure Ho with a precision better than 
2% and resolve the Hubble tension. The number of such golden binaries is given in table 
37, both for the dark siren case with BBHs, and for the gray siren case with NSBH systems, 
and an observation span of 10 years. We see that, when in a network with 2CE detectors, 
the performances of all the ET geometries considered are comparable in the measurement of 
the Hubble constant using the dark and gray siren approaches. In contrast, for ET alone, the 
L-shaped configurations with parallel arms perform significantly worse compared to the other 
configurations, as they are unable to detect a single golden event in an observing time of 10 
years. 


6.5 Cosmological stochastic backgrounds 


In this section we consider the performances of the various ET geometries for the detec- 
tion of specific examples of stochastic backgrounds of cosmological origin. In Section 6.5.1 
we compare the ET sensitivities with the predictions from models of cosmic strings, while 
in Section 6.5.2 we consider the stochastic background produced by a phase transition in 
the early Universe. Finally, in Section 6.5.3, we study the prospects for separating these 
cosmological backgrounds from the astrophysical stochastic background. 


6.5.1 Cosmic Strings 


Phase transitions followed by spontaneously broken symmetries may lead to the production 
of topological defects as relics of the previous more symmetric phase of the Universe. Within 
Grand Unified Theories, it has been shown [409] that cosmic strings (line-like defects) are 
generically formed. At the energy scales relevant to cosmology, the width of a string is negli- 
gible relative to its characteristic size. Such strings (Nambu-Goto strings) are parameterised 
by a single dimensionless quantity, the string tension Gy related to the energy scale 7 at 
which cosmic strings are formed through Gu ~ 10-9[n/(1019GeV)]? (we set the speed of 
light c = 1). 

Cosmic strings emit bursts of beamed gravitational radiation. The main sources of 
bursts are kinks, cusps or kink-kink collisions. Kinks are discontinuities in the tangent vector 
of the string that propagate at the speed of light and appear in pairs. Cusps are points on the 
string that briefly travel at the speed of light. The incoherent superposition of these bursts 
through the history of the Universe produces a strongly non-Gaussian stochastic gravitational 
wave background [410]. Occasionally there may also be sharp and high-amplitude bursts of 
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GWs above this stochastic background. The production of gravitational waves by cosmic 
strings offers a tool to probe particle physics beyond the Standard Model at energy scales 
much above the ones reached by accelerators. A non-detection of a stochastic background 
of gravitational waves imposes bounds on the cosmic string tension and therefore on particle 
physics models beyond the Standard Model. ET will be able to considerably improve on 
current 2G bounds [411]. 

In Figs. 62 and 63 we show the prediction for Ocw(/f) for two different models of 
the loop distribution of cosmic strings, denoted as models A [412] and model B [413], and 
different values of the string tension Gy. In model A the loop production function can 
be determined from Nambu-Goto simulations of cosmic string networks. In model B the 
distribution of non-self-intersecting scaling loops is the extracted quantity from different 
simulations. Within the latter model, loops are formed at all sizes following a power-law while 
the scaling loop distribution is cut-off on small scales by the gravitational back-reaction. In 
model B are produced many more tiny loops than in the model A. These two models for 
the loop distributions have been used by the LVK collaboration to put constraints on cosmic 
strings from O1, O2 and O3 data [411]. 

The various curves define the power-law integrated sensitivity for each ET configuration, 
assuming SNR=3 and Typ, = 10 yr. As already remarked in Section 5.1, the triangular con- 
figurations perform better at high frequency, because the overlap reduction function remains 
constant at high frequencies for co-located detectors, while it falls to zero for widely separated 
detectors. The 2L shape configuration does better around a frequency of 10 Hz, but only for 
co-aligned detectors (this maximises the coherence for the stochastic search). The case of 45 
deg 2L configurations is universally the worst. We conclude that, for the considered loop 
distributions (models A and B), the best stochastic constraints on the string tension are 
~ 10719, corresponding to an energy scale between 10° GeV and 3 x 10!° GeV, 6 to 8 orders 
of magnitude above the electroweak scale. 


6.5.2 First-order phase transition 


As the Universe cooled down, it may have undergone a first-order phase transition (FOPT) 
with bubbles of true vacuum forming, expanding and colliding [414-416]. This in turn leads 
to gravitational waves generated from the collision of bubbles, and from the sound waves 
and turbulence in the surrounding plasma (see e.g. [85, 417] for reviews). The frequency of 
gravitational radiation emitted depends on the temperature. For a temperature of the order 
(10° — 1019) GeV, energy scales inaccessible to particle colliders, we find a SGWB that peaks 
in the ET frequency range. Here we briefly discuss ET detection prospects of supercooled 
FOPTs, a special class of FOPTs which is expected to create the “loudest” gravitational 
waves [418, 419]. 

The gravitational wave spectrum from a supercooled FOPT depends only on two pa- 
rameters, namely the reheating temperature Try and inverse duration 6/Hpy, where we 
denote Hry = H (Tru). Note that the strength of transition a that affects the SGWB for 
the more general FOPTs, does not impact the supercooled spectrum. In Fig. 64 we simulate 
a SGWB containing the residual from unresolved CBCs and an FOPT signal for a range of 
Tru and 8/Hmgg values. The left panel includes a FOPT signal that is dominated by bubble 
collisions, while the right panel is for a FOPT signal dominated by sound waves. Turbulence 
is expected to be subdominant to both bubble collisions and sound waves, so we exclude it 


^'For simplicity, we assume wall velocity to be equal to the speed of light. 
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Figure 62: Energy density of gravitational waves emitted from cosmic string loops with loop 
distribution given by model A [412]. The various curves define the power-law integrated sensitivity 
for each ET configuration, assuming SNR—3 and Tops = 10 yr. 
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Figure 63: Energy density of gravitational waves emitted from cosmic string loops with loop 
distribution given by model B [413]. The various curves define the power-law integrated sensitivity 
for each ET configuration, assuming SNR—3 and Typ; = 10 yr. 
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Figure 64: Detectability of bubble collisions (left) and sound wave (right) dominated supercooled 
FOPT with next-generation ET detector. The dashed line is the detection threshold using 10 km 
triangular ET configuration, the dotted line is 15 km triangular configuration, while the solid and 
dot-dashed contours are for two L-shaped detectors with 15 km and 20 km arms, respectively. 


from our study. It is clear that we will be able to exclude a large part of the parameter space 
with the next generation of GW detectors. 


6.5.3 Source separation 


'The unresolved astrophysical mergers are expected to be the dominant contribution to the 
background observed by ET. It is therefore important to subtract the individual sources, as 
discussed in the previous section, in the hope to reveal the cosmological background. Note 
that the imperfect subtraction of CBC sources may pose a serious threat to digging out a 
cosmological background, and extensive research must be done to improve the subtraction 
methods [420, 421]. Here we take as an example two cosmological sources: cosmic strings 
and an early-Universe first-order phase transition. 


By limiting the study to the frequency range f € [10, 100] Hz, we can assume the CBC 
background to follow f2/3 and the cosmic strings background to be frequency-independent, 
ie. Qcg = const. The first-order phase transition is modeled as a broken power-law (with 
Q, amplitude and f, break frequency), with the assumption that the sound-wave component 
is the dominant contribution to the cosmological SGWB. We simulate astrophysical and 
cosmological SGWB and perform a model selection study of the signal. For details on the 
parameter estimation Bayesian tools, see |422]. We compute Bayes factors for a CBC signal 
to noise, and for a combination of cosmological and CBC signal to noise. Comparing the two 
Bayes factors demonstrates if one model is preferred over the other, i.e. if we can detect the 
presence of a cosmological signal. From Fig. 65 we conclude that a cosmic strings background 
of amplitude 2.2 x 1071} and a first-order phase transition background of amplitude up to 
4.0 x 10-1? (at 25 Hz) will be probed with this 3G detector. 


Note that the various ET configuration makes little quantitative difference to the re- 
sults, but including next-generation Cosmic Explorer can lead to great improvement in SGWB 
source separation. 
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Figure 65: Upper limits we can place on the amplitude of the background from cosmic strings 
(left panel) and a first-order phase transition (right panel) with ET’s triangular 10 km configuration 
(dashed line) and 2L 20 km configuration (solid line)J. We assume a strong preference for a signal 
that contains an astrophysical and a cosmological contribution, relative to a signal containing only 
unresolved astrophysical objects for log Bayes factors greater than 8. 


6.6 Continuous waves 


Broadly speaking, the class of continuous waves (CW) comprises all deterministic’? GW 
signals that last much longer than typical transient signals, whose durations vary from a 
fraction of a second to tens or hundreds seconds at the most. No CW has been detected 
so far, although interesting upper limits have been established, see e.g. [423-425] for recent 
reviews. The prototypical source of CWs is a spinning neutron star, asymmetric with respect 
to the rotation axis, that emits a quasi-monochromatic gravitational wave whose frequency 
changes extremely slowly over time. For such sources, the expected signal duration is much 
longer than one day, meaning that the frequency modulation due to the Doppler effect, caused 
by the motion of the Earth, and the phase and amplitude modulations due to the detector 
radiation pattern functions, play relevant roles. 

Actually, other sources of CW-like signals have been investigated. In particular, tran- 
sient CW signals (tCW) have been considered, such as those expected to be emitted by a 
newborn magnetar, in the early stages of its evolution [426-429]. In this case, the frequency 
secular variation is caused by a combination of the electromagnetic emission, due to the 
super-strong magnetic field, and the gravitational-wave emission, due to the strong asym- 
metry induced by the electromagnetic field itself. As a consequence, the resulting source 
spin-down is orders of magnitudes faster than for standard neutron stars, producing a signal 
that stays in the sensitivity band of detectors for hours or days. 

Recently, several theoretical works have shown that GW detectors can also be used to 
probe dark matter, see e.g. |430] for a comprehensive review. In many of the proposed mech- 
anisms, the resulting signal has a CW-like nature. This is, for instance, the case for depleting 
scalar or vector ultra-light boson clouds that could form around Kerr black holes through su- 
perradiance [431], the interaction of dark matter particles with detector components [432], ? 
or the early inspiral of sub-solar mass primordial black holes [433]. 

The typical duration of CW signals has some important consequences. First, once 


?^In contrast to GW backgrounds, which are described by means of stochastic processes. 
^?In this case the effect of the interaction is not a true GW signal, but is such to induce a relative motion 
of the optical components which can be searched as if it was a standard gravitational wave. 
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detected, a CW source can be monitored for long periods of time, becoming a true labo- 
ratory for fundamental physics and astrophysics, and potentially allowing observations of 
tiny deviations from the assumed models (for instance, a deviation from GR, in the form of 
non-standard polarizations [434]). At the same time, signal persistence allows us to reduce 
the false alarm probability of any candidate virtually to zero, because if a signal is present 
in a given dataset — even with low significance — it will also appear in new datasets, and its 
significance will increase until a detection can be claimed (of course, this is not necessarily 
true for tCWs). Moreover, CW source parameters can be measured with very high accuracy, 
as their uncertainties decrease with the observation time. On the other hand, CW signals are 
predicted to have small amplitudes, compared to e.g. typical CBC signals, and to search for 
them can require a huge computational cost. For this reason, the development of increasingly 
sensitive, robust, and computationally efficient algorithms is a very active field that grows 
in parallel to current detector upgrades and R&D for future detectors, such as ET. In the 
following, we give more details on different sources of CW-like signals, focusing attention on 
the impact that different ET configurations have on the chance of detection. 


6.6.1 CWs from spinning neutron stars 


Spinning neutron stars (NS), isolated or in a binary system, asymmetric with respect to the 
rotation axis, emit a persistent gravitational wave signal with frequency in a given ratio to 
the rotation frequency of the star. 

'The typical amplitude of CWs signals is difficult to estimate, as it depends on poorly 
known quantities, especially the star's ellipticity, which is a measure of its degree of asymme- 
try. In the case of a CW emitted by a non-accreting, spinning NS, assuming it is asymmetric 
with respect to its rotation axis, the signal amplitude is given by 


An^ G eL, f? 26 € I f 2/1 kpc 
= ~ 1.06 x 107 (=) (6.21 
c! d PAA 10-6) * e kg =) (x Hz d (620) 


where d is the source distance, f is the GW frequency (equal to two times the star’s spin 
frequency in the prototypical case of a NS rotating around one of its principal axes of inertia), 
€ = (kx — Iyy)/ Iz; is the ellipticity of the star, and 17; is the moment of inertia of the star 
with respect to the principal axis aligned with the rotation axis. 

The search for CWs from spinning NSs depends on the amount of information we have 


ho 


or we assume to have on the sources. For targeted searches, we use accurate measurements 
of the main NS parameters, namely the position, spin frequency and its derivative(s), and 
Keplerian parameters for NSs in binary systems, obtained from electromagnetic (EM) ob- 
servations (typically radio, gamma, or X-rays) to perform very sensitive searches based on 
matched filtering. Data from the detector network can be coherently combined in order to 
increase the sensitivity; see [435] for the most recent O3 results obtained by the LIGO and 
Virgo Collaborations for targeted searches. On the other hand, there are all-sky searches 
for NSs without an EM counterpart, in which a portion of the parameter space as large 
as possible is explored. Due to computational load constraints, such searches are based on 
hierarchical semi-coherent methods which sacrifice some sensitivity in order to make them 
computationally feasible. See [436] for the latest LVK all-sky search results. Searches cover- 
ing “intermediate” cases are also routinely carried out. For instance, narrow-band searches 
slightly extend the parameter space around the EM-inferred parameters of known pulsars, 
in order to account for possible small — fraction of a Hz — mismatches between the EM and 
GW emission |437]. Directed searches assume the source position is known, while the star's 
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Table 38: Expected number of detectable sources, assuming three different conditions for the 
ellipticity: € = esq (ni), e = min(e,q, 1079) (ng), € = min(esa, 107?) (na), assuming a total 
observation time Tops = 1 year and a duty cycle of 8596. For each case, we give in parentheses the 
minimum and median value of ellipticity for detectable signals. 


Configuration nı n2 na 
A 10km 866 (2.5 x 10719, 1.3 x 10-4) | 180 (2.5 x 10729, 4.4 x 1079) | 19 (2.5 x 10719, 7.5 x 10719) 
A 10km HF-only | 398 (2.5 x 107 10, 6.2 x 1079) | 178 (2.5 x 10719, 4.4 x 1079) : (2.5 x 107 10 7.5 x 10719) 
A 15km 983 (2.1 x 107 10. 1.1.x 10-4) | 214 (2.1 x 10719, 4.4 x 1079) | 33 (2.1 x 107 10. 7.9 x 10719) 
2L 15km 959 (2.0 x 107 10. 1.2 x 1074) | 206 (2.0 x 10719, 4.2 x 1079) | 29 (2.0 x 107 10. 8.1 x 10719) 
2L 15km HF-only | 451 (2.0 x 107 1. 5.6 x 107 °) 203 (2.0 x 10719, 4.0 x 107°) | 29 (2.0 x 107 10. 8.1 x 10710) 
2L 20km 1035 (1.8 x 107 10. 1.1 x 1074) | 227 (1.8 x 10710, 4.3 x 1079) | 33 (1.8 x 107 10 7.3 x 10710) 


rotational parameters are unknown, as in the case of the supernova remnants or the galactic 
center [438, 439]. All-sky and directed searches typically use data from different detectors 
(or runs of the same detector) in order to make coincidences among outliers, with the aim of 
suppressing the false alarm probability. Although no detection of CWs has been made so far, 
interesting upper limits on the star's ellipticity have been placed, see references above. These 
can be interpreted in terms of constraints on the residual initial star's deformation frozen 
after crust solidification or as due to an inner magnetic field misaligned with the rotation 
axis, using a given ratio with the external poloidal one. 

Although the range of CW searches for spinning NSs is very wide, in the following we 
provide some figures of merit about detection perspectives by E'T focusing on two relevant 
cases, targeted and all-sky searches, that bracket the span of typical searches in terms of 
dimension of the explored parameters space, sensitivity and computational cost. 

For targeted searches, we provide some summary results in Tab. 38 where, for different 
ET configurations, the number of potentially detectable sources (among those contained in 
the ATNF catalogue ^^) is given, under three different assumptions of the source ellipticity. 
We consider a total observation time Top, = 1 year, a duty cycle of 85% and assume to make 
a fully-coherent search based on matched filtering. In the case of 2L-shaped network, results 
do not depend on the relative orientation of the two detectors. Columns 2 indicates the 
number of detectable pulsars under the assumption that they emit at their spin-down limit, 
i.e. that all their spin-down is due to the emission of GWs. This provides a theoretical upper 
limit to the emitted strain, given by 


hosa = een)" (6.22) 


29 Ff 


where Í is the signal frequency first time derivative. The corresponding limit on the star 
ellipticity is 


h h 1093 kgm? \ /100 Hz V? 
eq = Psd gud ~ 9.46 x 19-6 ( "0s ) y 0% kgm 00 Hz d 
Uu An?G f? 10725 Tez f 1kpc 
(6.23 
'The number of pulsars given in column 3 is computed by assuming that every star's ellipticity 


is the minimum of the spin-down limited value esą and 1079, which is of the order of the 
maximum theoretical ellipticity a NS with a standard EOS could have [440]. In column 4, 


^ nttps://www.atnf.csiro.au/research/pulsar/psrcat/ 
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the number of detectable pulsars is computed assuming the minimum ellipticity between €.q 
and 1079, which is considered as a plausible value for millisecond pulsars [441, 442]. For 
each case, we also report the minimum and median value of the ellipticity for the potentially 
detectable sources. 

From this table, we can draw some considerations. The main factor affecting the number 
of detectable sources is the arm length, rather then the network geometry. Indeed, by coher- 
ently combining individual detector data streams, the triangle configuration would provide 
a sensitivity gain of 3/2, with respect to the 2L network, which is nearly fully balanced 
by the shape factor, so that at the end what mostly contributes to the sensitivity is the 
arm length. We also note that, for the more realistic choices on star’s ellipticities, corre- 
sponding to columns ng and n3, most of the potentially detectable sources are millisecond or 
young fast pulsars, spinning at frequencies larger than a few tens of Hertz, for which the low 
frequency sensitivity of the detectors is not particularly relevant. The corresponding typical 
values of the ellipticity are of about 2 x 10~!°, corresponding to “mountains” extending from 
the star’s surface not higher than about lum. Lastly, focusing on the most conservative 
condition, given by column n3, we find that the number of potential detections increases by 
50-70% going from 10km-arm detectors to 15 or 20km-arm detectors. 

Wide-parameter searches are based on semi-coherent methods in which data segments 
of given duration are incoherently combined. In principle, they are less sensitive than tar- 
geted ones, but can cover a huge parameter space and are intrinsically more robust against 
non-predicted deviations from the assumed signal model. Concerning, in particular, all-sky 
searches, Fig. 66 shows, for different detector configurations, the minimum detectable ellip- 
ticity as a function of the signal frequency, assuming source rotational evolution is dominated 
by the emission of GWs and a fixed source distance of 8 kpc. These results have been obtained 
considering an analysis done by means of the FrequencyHough pipeline [443], a semi-coherent 
procedure routinely used in current CW searches, whose sensitivity can be estimated (under 
the assumption the noise is Gaussian) as 


497 |S,(f) 
Amin,95% ~ Ni/4 jm V CRihr + 1.6449, (6.24) 


where N = Tobs/TFFr is the ratio between the total observation time and the duration of 
the data segments to be incoherently combined, S,(f) is the detector average noise power 
spectrum and CR; is the threshold Critical Ratio to select outliers. In the plot, we used 
T bs=1 year, Tprp=10 days and CR, = 3.4 for triangular configurations and CRip, = 4 for 
L-shape configurations. The presence of more that one detector is exploited in such kind of 
searches to strongly reduce the false alarm probability. For the standard search parameters 
considered here, the number of points in the parameter space (sky position, frequency and 
frequency derivatives) is of the order of 10??. Assuming that for each detector of the order 
of 10! outliers are selected, ^^ it is possible to show that double coincidences are enough 
to reduce the number of candidates to O(10), and that triple coincidences would allow - in 
principle - to shrink this number to a negligible level, assuming ideal Gaussian noise [443]. 
As a matter of fact, in practice we expect a significant number of candidates to survive the 
coincidence step, due to non-Gaussian instrumental artefacts which, inevitably, pollute the 
data. Having triple coincidences, as for the triangle configuration, allows to better reduce 


^ Currently, the typical number of selected outliers is o(10'°), but we devise an increase of two orders of 
magnitude as reasonable in the ET era. 
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Figure 66: ET sensitivity reach to all-sky searches of CWs (95% C.L.). The various continuous 
curves are the minimum detectable ellipticity as a function of frequency for different detector config- 
urations, assuming a source distance of 8 kpc, an observation time Tops=1 yr, a duty cycle D=85% 
and data segments duration Tppr=10 days. The dashed curve indicates a constant spin-down of 
10-5 Hz/s. Right plot is a zoom of the left plot in the high frequency region. On the scale of the 
right plot, the curves corresponding to A 10 km HF and A 10 km coincides, and similarly for those 
corresponding to the 2L 15 km HF and 2L 15 km configurations. 


false candidates with respect to the 2L configuration. This implies that in the former case 
we can use a slightly lower threshold for outliers selection, with the aim of having a similar 
number of candidates after the coincidences. A lower threshold, according to eq. (6.24), 
corresponds to a better sensitivity. Of course the exact value of the threshold would depend 
on the non-Gaussian, non-stationary features of the actual dataset, which are difficult to 
predict in advance. As a rule of thumb, we decide to reduce the threshold on the CR from 4 
to 3.4 in the case of the triangular configuration. This choice would correspond to a reduction 
of one order of magnitude in the false alarm probability in presence of Gaussian noise. 

As for targeted search, the relative orientation of the two triangular detector in the 
2L-shaped configuration does not impact on the analysis outcome. A clear feature of these 
results is that different configurations do not produce very different results, especially in the 
high frequency range, which corresponds to smaller ellipticities. In general, however, longer- 
arm detectors perform better than shorter ones, with differences up to a few tens percent. 
For instance, a source located at 8 kpc, and emitting a signal at about 500 Hz, would be 
detected if its ellipticity is at least ~ (4 — 5) x 1075. At a given frequency, the ellipticity 
scales linearly with the distance. Nearby sources, located within ~ 1 kpc, would produce 
detectable signals with an ellipticity as small as few times per 10-?, if spinning at around 
500 Hz. Although current searches typically use a shorter data segment duration, values as 
large as 10 days are reasonable, and likely conservative, for the ET era, given the algorithmic 
developments and the increase in available computing power. It is interesting to stress, 
however, that while the sensitivity of semi-coherent searches scales as the fourth root of the 
segment duration, the dimension of the parameter space scales at least as the duration to 
the fifth power. This means, for instance, that using segments of 30 days, instead of 10, 
would provide a theoretical sensitivity gain of about 30%, in front of a computing cost at 
least 3° times bigger. Moreover, and probably even more importantly, the use of very long 
segment durations reduces the search robustness with respect to unmodeled signal features, 
which can translate into a sensitivity loss. As an example, a source may have a significant 
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intrinsic transversal velocity with respect to the line of sight. If not taken into account, it may 
determine a sub-optimal Doppler correction as a consequence of the change of sky position 
during the observation time. Another possibility is that a signal may be characterized by some 
level of frequency wandering due to some unpredicted mechanism, involving the interaction 
with other objects, or due to intrinsic processes in the source (see e.g. |444, 445] for analysis 
methods that tackle this issue). It is then important to note that a reasoned balancing among 
theoretical sensitivity, available computing power and algorithm robustness must be always 
taken into account when new algorithms for wide-parameter searches are developed. 


6.6.2 "Transient CWs 


Newly-born neutron stars could be extremely deformed and emit very strong GWs immedi- 
ately after they have formed from a binary neutron star merger or supernova explosion. See 
[446, 447] and references therein for recent observational evidences. 

Such systems are astrophysically very interesting: detecting a remnant of a neutron star 
merger would help to improve constraints on the equation of state [448-451] compared to 
only considering the inspiral of the system alone [231, 452, 453] in the case of binary neutron 
star mergers or only electromagnetic observations of the supernova. Furthermore, detecting 
such a system would be a huge step in multi-messenger astronomy, and would allow us to 
determine the nature of the remnant, something at which electromagnetic observations can 
only hint. As we saw from GW170817, astronomers were able to see the remnant in multiple 
ways —UV, IR, visible — and each way seemed to imply a different expected remnant. Some 
studies report the formation of a hypermassive neutron star lasting for O(s) [454-458], while 
others saw continued energy emission from a long-lived remnant |459—462]. In the latter 
case, the signal emitted would last longer than the canonical O(s) expected from mergers, 
but shorter than the durations O(years) expected from much older, asymmetrically rotating 
neutron stars. This "intermediate" regime of signal durations would contain very interesting 
physics, and yet until recently, not much effort had been made to search for these kinds of 
sources. 

Such searches for a remnant of GW170817 were performed on O2 data, analyzing data 
in segments of O(seconds) for burst-like signals and O(hours-days) for tCWs, and set the 
first-ever upper limits on GW emission from a remnant [463, 464]. In contrast to ordinary 
CWs, tCWs last for a much shorter time, have very large spin-downs, and have frequencies 
that evolve non-linearly over time. Thus, new analysis methods had to be designed to search 
for these signals [465—470]. The sensitivity of the O2 search allowed us to reach distances O(1) 
Mpc; however, with ET, we will be able to reach distances of O(10 — 100) Mpc, depending 
on the degree of deformation of the neutron star. To compute this sensitivity, we use an 
equation similar to eq. (6.24), that has been generalized for power-law signals and gives the 
maximum distance reach at a given confidence level: 


1/2 


dmax 7 18107 m (c 9 ,) (Ea) c EE (37 SI] (Race 169). 


(6.25) 

Here, F; = f? is short-hand for the ith frequency, and the sum goes over all frequencies 
during Tops. 

In the left-hand panel of Fig. 67, we plot the expected distance reach for different ET 

configurations, for a braking index n = 5, i.e. pure GW emission. The ellipticity at each 

frequency is the maximum allowed by energy conservation, and we restrict to ellipticities less 
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Figure 67: Left: ET distance reach to transient CW signals emitted by remnants of binary neutron 
star mergers or supernovae for various arm lengths and shapes, fixing the configuration to allow 
both high and low-frequency enhancements. The ellipticites used to construct these curves are the 
maximum allowed by energy conservation, assuming that all rotational energy is emitted in GWs, 
and imposing e < 5 x 1073, the largest expected degree of deformation. At frequencies for which the 
ellipticity given by energy conservation exceeds 5 x 10°, we impose e = 5x 1073. Right: ET minimum 
detectable ellipticity for different configurations for newborn neutron stars 6 Mpc away. Both plots 
use fọ < —1 x 10-4 Hz/s, Terr ~ 70 s, Tops = 1 day, and I,, = 1039 kg. m?. 


than 5 x 1073, the largest expected for newborn neutron stars [471]. However, at frequencies 
below ~ 1200 Hz, the ellipticity given by energy conservation is much larger than that 
allowed theoretically; hence, we impose e = 5 x 107? up to 1200 Hz. In this regime, we see 
that the distance reach steadily increases with frequency at a fixed e = 5 x 107%, which is 
consistent with eq. (6.25). For frequencies greater than 1200 Hz, the distance reach decreases 
slightly, since at this point, the ellipticity allowed by energy conservation is in fact less than 
5 x 1073. Magnetars formed in core collapse supernovae look more promising, in terms of 
event rate, with respect to those following the merger of NS binary systems. The overall 
core collpase supernova rate is of the order of one event per year within 10 Mpc (4 events/yr 
within 20 Mpc) [472]. The fraction of core collapses leading to the formation of a magnetar 
is uncertain and, while past estimations pointed to the (1 — 10)% range [473], more recent 
results, based on observations and a more proper consideration of selection effects, typically 
provide higher values, even of the order of 40%, see e.g. [474]. This would correspond to 
a magnetar formation rate of — 2 per year within 20 Mpc, which is well within the search 
distance reach for a wide range of initial spin frequencies, for the optimistic deformation rate 
discussed above. 

In the right-hand panel of Fig. 67 we plot the minimum detectable ellipticity at a fixed 
distance of 6 Mpc, at which we expect to detect magnetars forming at a rate of one per 
5-10 years. At such a distance, the only low-frequency (< 20 Hz) sources that could produce 
a detectable gravitational-wave signal are those with unphysical ellipticities, i.e. € > 1, so 
we omit those on the plot. We only consider frequencies at which the minimum detectable 
ellipticity is less than 5 x 1073, which is consistent with the left-hand panel. In fact, at such 
distance, we would be able to detect the tCW from newborn magnetars with ellipticity of 
the order 107? — 1074, much lower than the maximum predicted value. 

We see that the differences among configurations are mostly driven by the arm-length, 
and there is little difference between the 2L-15km and 15km triangle. As in the CW case, we 
have applied CR, = 4 for the 2L case, and CR, = 3.4 in the triangle case to capture the 
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anticipated improvements in both computational power in the 3G era and improved coinci- 
dences resulting from a triangle-shaped detector. Furthermore, these plots are created with 
the full HFLF xylophone configuration. If we were to remove the low-frequency instrument 
in the left-hand panel, the distance at low frequencies < 100 Hz (not shown) decreases from 
O(1-— 10) kpc to O(0.1 — 1) kpc. This could be relevant, since the enhancements due to 
the LF instrument allow us to reach the galactic center at low frequencies. However, in the 
right-hand panel, the results do not vary at all, since the ellipticities are already unphysical 
at frequencies below 30 Hz (not shown). 


6.6.3 Search for dark matter with CWs 


GW detectors could probe the existence of some kinds of dark matter (DM) [475]. In many 
cases, DM has been predicted to produce CW-like signals, so that data analysis methods used 
for the search of “standard” CW or tCW have been extended, see e.g. [476], and applied 
to the search for DM signatures. Ultralight bosons are interesting DM candidates, including 
dark photons and Quantum ChromoDynamics axions [431, 477—479]. Potential detectable 
systems might be boson clouds formed around spinning BHs or vector bosons in form of 
dark photons. Detectable continuous signals can arise also from compact dark objects or 
primordial black holes. 


Boson clouds around black holes  Ultra-light bosons fields present in the nearby regions 
of a Kerr black hole can clump around it through a superradiant mechanism, forming a 
BH-boson “cloud” system at the expense of the BH angular momentum [431, 479, 480]. 
'This formation channel is maximally efficient when the particles's Compton wavelength is 
comparable to the size of the BH, h/(myc) ~ GMpy/c?, where m; is the boson mass, Mpy 
is the black hole mass, h is the reduced Planck's constant, and c is the speed of light. Once 
formed, the cloud starts to dissipate via particle annihilation to gravitons, emitting a quasi- 
monochromatic and long-duration signal with a frequency dependent mainly on the boson 
mass, 
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Earth-bound detectors are mostly sensitive to boson masses in the 10714 — 1071 eV/c? 
range [445]. The BH-boson cloud system is often referred to as a gravitational atom. The 
emitted signal is characterized by a spin-up due to the cloud mass decrease, which reduces its 
binding energy |481, 482], as well as to boson self-interaction [483]. The GW signal amplitude 
is 


h(t) = ho (1 + d , (6.27) 


where hg is mainly determined by the BH and boson masses, the BH initial spin and the 
distance, through 
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where a is the the fine-structure constant of the gravitational atom, given by 


GMggn myc? 

cà h 
The gravitational-wave timescale Tgy is also dependent on the BH mass, the boson mass and 
the BH initial spin [431, 445, 482]. The detection of CW signals from boson clouds would 


(6.29) 
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allow to establish a fascinating connection among particle physics and black holes. Null 
results, as those obtained so far, see e.g. [445, 484, 485], brings anyway valuable constraints 
on the permitted boson masses. 


Dark-matter particles interacting with GW detectors Another interesting scientific 
scenario for ET are ultralight particles behaving as a classical field interacting coherently 
with the atoms of the test masses. For instance, dark photons coupled to the baryon or 
baryons minus leptons number U(1)p/U(1)p.r have been considered in literature. These 
particles can produce an oscillating force on dark charged objects |432, 486, 487]. The same 
type of behavior is expected for tensor particles [488]. Different production mechanisms have 
been proposed for the dark photon production, such as misalignment mechanism associated 
with the inflationary epoch, light scalar decay or cosmic strings [489, 490]. The dark photon 
DM field oscillations impinge a time-dependent Equivalence Principle-violating force acting 
on the test masses. This will produce a change in the relative length of the detector’s 
arms, causing a signal strain at the detector output. The time-dependent force acting on 
the test masses produces a strain oscillating at the same frequency and phase as the dark 
photon field [432, 491—493]. It should be noted that a similar signal may be produced in the 
case of scalar dark matter particles directly interacting with the mirrors. In particular, as 
described by [494], the interaction of the scalar bosons with the detector beam-splitter will 
induce oscillations in the thickness of the mirror due to the oscillations of the fundamental 
constants [495—497]. 'They could also interact with the particles in the reference cavity in the 
detector, resulting in a similar measurable oscillation of fundamental constants [498]. The 
signal frequency is determined by the dark photon mass, fo = m,c?/(27h), corresponding 
for Earth-based detectors to dark photon masses in the range 10714 — 107! eV/c?. In fact, 
the signal is not exactly monochromatic due to the fact dark photon particles follows a 
Maxwell-Boltzmann velocity distribution. 

Taking into account the physics of dark-matter signals, different CW methods based 
on cross correlation and excess power have been applied to look for these types of signals 
[432, 491, 499, 500], and have resulted in competitive constraints on U(1)g coupling using 
data from the most recent observing run of advanced LIGO/Virgo/KAGRA [492, 501], and 
on scalar dilaton dark-matter coupling to electrons and photons using GEO600 data [494]. 
Furthermore, projected constraints for axions altering the polarization of light shining down 
the interferometer arms have been produced [502, 503]. 

When considering the different configurations and geometries, we note that the 2L 
geometry would experience a larger strain due to dark photons by a factor of 2/4/3 ~ 1.15 
compared to a triangle-shaped detector, independent of the arm-length [432]. Since any 
frequency (or mass) for the dark photon or other dark matter particle is possible, both the 
low- and the high-frequency instruments of the ET xylophone configuration are important. 
Furthermore, increased arm length is also the primary way to improve sensitivity to these 
particles, since they behave just as traditional CWs, though 7Trrr could not be as long as 
10 days because of the frequency modulation induced by the Maxwell-Boltzmann distributed 
velocities. Finally, the triangle-shaped detector would allow triple coincidences, but since 
the dark-photon signal would show up looking like “correlated noise" between the detectors, 
having a single triangle-shaped detector in one place may increase the number of correlated 
noise disturbances, reducing the sensitivity to dark-matter particles. 


Inspiraling compact dark objects and PBHs. Dark matter can be present in our 
universe in form of macroscopic objects, we generically refer to as compact dark objects 
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(CDOs). The actual formation channel and the origin of CDOs is still widely debated. 
CDOs can form pairs and emit an almost monochromatic signal while being far from the 
coalescence phase. Primordial BHs (PBHs), that we have discussed in detail in Sections 6.3.2 
and 6.3.3, are an example of CDOs. In the context of CW signals, binary systems made of 
sub-solar mass PBHs as well as more generic CDOs, represent potential targets. Indeed, the 
GW signals emitted by these systems, when the two compact objects are far away from the 
coalescence and their masses are small enough, say < 107? Mo, i.e below the mass range 
considered in Section 6.3.3, can be modeled as CW or tCW signals with a spin-up described 
by a power law, see e.g. [433, 504]. 

If the chirp mass is small enough, the frequency can be modeled exactly as the linear 
frequency Taylor expansion used for standard CW searches. This means that, for instance, 
a pair of inspiralling PBHs with chirp masses below 107? Mo would emit a GW signal 
indistinguishable from those arising from non-axisymmetric rotating NSs spinning up, and 
upper limits have already been placed on the fraction of dark matter that PBHs could 
compose (fppyH) using CW results from the O3a and O3 observing runs [436, 505]. 

These upper limits on fppy derived from CW searches are of particular importance, 
since they lie within the “asteroid-mass” PBH mass regime. In this mass range, there are 
almost no existing constraints on fppy [258]. ET will allow us to probe realistic fppy in the 
asteroid-mass regime (currently upper limits on fppy >1 from these searches), and will result 
in an improvement of 1-2 orders of magnitude, regardless of the detector configuration or 
geometry [433]. The same comparison of different detector designs discussed in Section 6.6.1 
for all-sky searches would be applicable here, since the constraints on fppy are derived from 
all-sky search results. 


6.6.4 Conclusions 


We have considered here the impact of different ET geometries and configurations on the 
detectability of persistent GW signals. In the case of CWs from slowly spinning down asym- 
metricaly rotating neutron stars, we find that the number of detectable sources in targeted 
searches is mostly sensitive to the arm length while, for realistic degrees of deformation to 
high-frequency enchancements, the shape of the detector does not have a relevant role. Fur- 
thermore, in all-sky searches, we expect to be able to use a value of Tpprp that is ~ 100 
times longer than used in analyses today, and thus find that we can constrain the ellipticity 
of unknown neutron stars to be less than 1 x 107” at frequencies above 300 Hz from the 
galactic center. This sensitivity is also mostly a function of the arm-length, and does not 
vary significantly between design choice. 

For transient CWs, the arm length is also the most important factor that determines 
the relativity sensitivity of the different configurations considered, with the low-frequency 
sensitivity only helping to reach the galactic center at signal frequencies below ~ 30 Hz. 
However, we considered sources with initial spin-downs one to three orders of magnitude 
smaller than those searched for in [464], determined by the maximum allowed ellipticity 
predicted theoretically. This limited our distance reach by a couple of orders of magnitude 
compared to the ellipticities 107? — 107! considered in [464]. 

We have hinted at the possibility of constraining exotic physics with CW and tCW 
searches, and note that the results presented here can be recast in terms of constraints on, 
e.g. the fraction of dark matter that PBHs could compose, the maximum coupling strength 
to baryons that dark-matter particle could have, or boson/black hole mass pairs that could 
be excluded from existing. 
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All these results must be interpreted carefully, since they assume Gaussian noise and a 
lack of detector artifacts. In practice, glitches and noise lines will affect the sensitivity to all 
of these sources. Noise lines will inhibit dark-matter and CW searches the most, since they 
focus on persistent, long-lived GWs, while glitches are more problematic for the transient 
CW searches. We have tried to capture the impact of improved coincidences by lowering 
CR for the triangle-shaped detector compared to the 2L one; however, these thresholds 
are subject to change in practice based on available computing power and the characteristics 
of the noise. We do not consider explicitly how the so-called “null stream" would impact the 
detectability of these sources, though we expect that our methods, already designed to be 
robust against noise disturbances and glitches, can achieve comparable sensitivities as those 
obtained in Gaussian noise, based on current GW searches. 


7 The role of the null stream in the triangle-2L comparison 


The null stream is a signal-free linear combination of the interferometer strain data, and it 
plays an important role in the comparison of 2L and triangle (A) configurations of ET. A null 
stream can be formed for an arbitrary GW detector network with at least three detectors, 
but generally only for one GW signal at a time [506]. However, the A configuration allows 
one to form a null stream that cancels out all gravitational-wave signals simultaneously [117], 
because of its closed geometry of component arms and almost negligible light travel times be- 
tween components. The null stream in the A configuration (from now on just “null stream” ) 
has the unique ability to provide access to a signal-free channel, and therefore access to the 
properties of (incoherent) instrumental noise without contamination from gravitational-wave 
signals [146]. 

It should be stressed that inference from data with the null stream noise discarded is 
identical to coherent inference from the full set of data so that access to the null stream 
does not directly improve inference [507]. Moreover, a 2L configuration without access to 
the null stream may still achieve good noise mitigation but requires more complex methods 
that introduce uncertainty into the process. Access to the null stream provides a more 
straightforward way (compared to the 2L configuration) to extract the power-spectral density 
(PSD) of the detector noise and enables unique data-analysis techniques for improved science 
extraction. In the following, we will describe some of these techniques and their impact on 
science extraction. 


Estimation of unbiased noise power spectrum: Within the A configuration of ET, 
the null stream dnu can be straightforwardly formed by 


dmu = di + d2 + d3, (7.1) 


where d; are the strain data for the individual detectors in the triangle. If the noise properties 
are homogeneous and incoherent among these detectors, then the noise PSD of each detector 
S» can be estimated by 
; 1 

Sh a zon (7.2) 
where St"! is the PSD of the null stream. In the absence of a null stream, it is non-trivial 
to disentangle the detector noise from the unresolvable GW signals, which is also referred to 
as “confusion noise”. 
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Assuming uncorrelated noise, another method to estimate the instrument-noise PSD 
is by calculating the cross power-spectral density (CSD) of the null stream with the data 
streams [146]: 


(dnund;) T^ Shs (7.3) 


where S$ is the PSD of detector i. This method does not require the PSD of different 
interferometers to be the same. 

Noise can be produced identically in two interferometers, which would therefore not 
appear in the null stream and bias PSD estimates that use the null stream. Technically, the 
bias in PSD estimates is produced by the real part of the complex-valued instrument-noise 
CSDs between interferometers. The natural magnetic background is a possible source of such 
noise. Also seismic gravity fluctuations are considered a possible source of identical noise. 
However, over the > 400m distances between any two test masses of two LF interferome- 
ters, this so-called Newtonian noise is most likely weakly correlated [508], which means that 
one should only expect a mild suppression in the null stream. See Section 5.4 for a more 
in-depth discussion of these noise sources. Ref. [147] shows that, given knowledge of the 
correlated/non-identical noise sources (e.g. through witness sensors), unbiased instrument- 
noise PSD estimates can in principle be obtained even in the presence of correlated noise. 

Impact on science extraction: The most direct impact of an unbiased estimate of the 
noise power spectrum is on the measurement of the stochastic gravitational-wave background. 
By subtracting the null stream PSD from the individual detector PSD, one obtains the PSD 
of the GW signals S, present in the data 


T 
Sy = Si — zon (7.4) 


where the equality holds when the noise is identical amongst individual detectors (see e.g. 
Refs. [146] and [147] for discussion on deviations from this assumption). Sp can be directly 
related to the SGWB if the data contains no resolvable signals. Otherwise, one has to first 
subtract the loud sources from the data (see e.g. Refs. [509, 510]). 

The inability to disentangle detector noise from confusion noise also has the effect of 
raising the overall perceived noise level. In a templated GW search, the effect manifests as 
a loss of matched filtering SNR, and it is demonstrated in Fig. 68. Fig. 68 shows the CBC 
horizon redshift (left) and its percentage loss (right) as a function of the total source-frame 
mass and for different merger rates. The loss in the redshift reach of ET increases with a 
higher merger rate of compact binaries, and for BBHs it may appear between z 2.5% and 
x~ 20%, depending also on the mass of the CBC. For the A configuration, the efficiency for 
mitigating this effect approaches 10096 as the sensitivity of all ET components approaches 
the same level. 

Furthermore, the confusion noise impacts a calculation of the false alarm rate (FAR), the 
probability of incorrectly rejecting the noise hypothesis. The FAR is calculated by comparing 
the noise-induced (background) distribution of a detection statistic (e.g. SNR for matched 
filtering) with the one measured from a signal candidate. For example, one could perform 
matched filtering on time shifted data (between the individual detectors) to estimate the 
background distribution (see e.g. Ref. [511]). FAR calculations typically assume that the 
number of genuine GW detectable signals is low enough not to impact the estimate, which 
will not be the case for ET anymore. By estimating the noise background distribution 
directly from the null stream (e.g. time shifting the null stream with respect to itself), one 
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Figure 68: The loss in the ET detection horizon, as estimated in [509], due to the confusion of 
instrumental noise and compact binary inspirals. Dashed lines show the effect of the median local 
merger rate and dotted lines represent the upper merger rate. A higher merger rate introduces a 
stronger confusion noise and hence the higher horizon redshift loss. This effect can be mitigated by 
the ET null stream available in the A configuration. 


could mitigate the contamination from confusion noise and thereby improving the detection 
sensitivity and accuracy. 


Mitigation of transient detector glitches: A number of known instrumental noise arte- 
facts such as glitches invalidate the standard assumption that the data is described by a 
superposition of the Gaussian instrumental noise and GW signals. If unmodeled, they will 
appear as false positives, whereas modeling glitches introduces additional uncertainties. Un- 
like GWs, these artefacts will appear as non-Gaussian outliers in the null stream, which 
provides an opportunity to remove them. Ref. [146] outlines two methods for eliminating 
glitches with the null stream. First, one can identify glitches by match-filtering them in 
the null stream. Second, one can identify null stream data segments that are inconsistent 
with Gaussian noise. The authors show that it is possible to end up with a clean Gaussian 
background in the A configuration in the limit where ET components approach the same 
sensitivity. 

Impact on science extraction: 'The non-Gaussian part of the background causes extended 
tails in the noise background and therefore limits the FAR. The exact extent to which this 
happens will only be known when ET is operational. Current gravitational wave observatories 
not only discover more GWs while achieving a better sensitivity but also retract more spurious 
signals [512]. This increasing trend might continue down to the design sensitivity of the ET. 
'The null stream will reduce the uncertainty associated with modeling glitches, and allow us 
to mitigate the effects of non-Gaussian noise. The most prominent benefit of the null stream 
is expected for high-mass distant BBHs that mimic glitches as well as for signals that are 
only described by phenomenological models. 


Control of known and unknown systematic errors: Any errors in the detector cali- 
bration can propagate into the null stream to cause incomplete cancellations of gravitational 
wave amplitudes [513]. If the signal waveform is a-priori well understood, and if its pa- 
rameters are well-measured by the network of detectors, then the residual signal in the null 
stream will be the product (in the frequency domain) of the calibration error and known 
weighted amounts of signal [514]. One can detect this residual by performing matched fil- 
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tering on the null stream. The calibration error can be obtained by fitting with a family of 
specific functions supplemented by the SNR output of the matched filters over a number of 
detected events. The ET null stream in the triangle configuration reduces the uncertainty 
in calibration parameters, which may propagate to GW signal parameters. However, the 
level of systematic errors in detector calibration in ET is not known, and even for current 
detectors it is only known to be less than 2% [515]. Therefore, it is not clear how many 
signals in ET will be affected by these systematic errors. Further studies are also necessary 
to demonstrate the performance of ET null stream calibration compared to other techniques, 
as well as the impact on the gravitational wave parameter estimation. In the worst case 
scenario, in 2L detectors calibration uncertainties will have to be estimated simultaneously 
with gravitational-wave source parameters, which would increase the computational cost of 
the analyses and introduce additional uncertainty. Note also that Ref. [514] estimates that, 
in a network of at least three L-shaped 3G detectors, the calibration error can be inferred at 
the percent level if supplemented with O(100) relatively loud (SNR=20) events. 


8 Summary 


In this work we have performed a detailed study of the Science Case of ET, significantly 
expanding the study in ref. [15]. We have considered both ET in its reference design (a 
10 km triangle in a xylophone configuration, with an HF instrument and a cryogenic LF 
instrument), as well as variations on the reference design, both in the geometry (triangle vs 
2L, with different arm lengths) and in the ASD (full sensitivity vs HF instrument only). 


First of all, the significantly more detailed study of the Science Case performed here confirms 
and provides more detail and evidence for the picture summarized in ref. [15]: ET, im its 
reference configuration, is a superb detector, with an extraordinary discovery potential in the 
domains of astrophysics, cosmology and fundamental physics. With order 104 — 10° BBH, 
BNS and NSBH detections per year, it will address and provide answers to an extremely rich 
and varied set of scientific questions; at the same time, it will penetrate deeply into unknown 
territories, where revolutionary discoveries could await for us. 


Starting from the extraordinary potential of the reference design, it is, however, still manda- 
tory to study what happens under variations of this baseline. At the very least, this allows us 
to understand how the different elements of the design, such as its geometry or the relative 
roles of the HF and LF instruments, concur to produce the scientific output of ET, and what 
could be reached in intermediate stages of the commissioning of the detector. Furthermore, 
this study can suggest directions where changes of the baseline design could results in im- 
provements of the science output. The original structure of the ET design was first laid 
down 10-15 years ago [10-12], well before the first detections of GWs. At that time, one 
was not even sure that there was, out there, a population of BBHs that coalesced within a 
Hubble time. Now we have a statistically significant catalog of detections, and building on 
the understanding that has accumulated thanks to these discoveries and the studies that they 
have stimulated, we are now able to present in this paper a study based on astrophysically- 


motivated populations.*° 


46T ast but not least, the setting up and the development of the ET Observational Science Board allowed the 
formation of a coherent scientific community focused on ET, with the necessary expertise across the rather 
broad range of subjects needed for such a study. 
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8.1 Comparison of different geometries 


In this paper we have examined several different geometries, comparing two L-shaped detec- 
tors on widely separated sites to a single-site triangle, and with different choices of arm-length 
(and relative orientations for the two L-shaped detectors), to provide a broad set of options. 
We will begin by summarizing the main results from the comparison of the 15 km 2L config- 
uration with the 10 km triangle, in Section 8.1.1. In Section 8.1.2 we will instead summarize 
the results of the comparison between the 15 km 2L and the 15 km triangle. In Section 8.1.3 
we also summarize the results for a single third-generation L-shaped detector. Further el- 
ements of the comparison between geometries, including correlated noise, the null stream, 
etc., are summarized in Section 8.1.4. 


8.1.1 Comparison between 15 km 2L and 10 km triangle 


Focusing first on the comparison of the 15 km 2L with the 10 km triangle, the main results 
can be summarized as follows. 


e Binary Black Holes. The results of Section 3.1 show that, for BBHs, the 2L con- 
figuration with 15 km arms, oriented at 45?, is superior to a 10 km triangle for the 
estimation of all parameters, and especially luminosity distance, see Fig. 4 in the main 
text and Tables 39-42 in App. C. To put things into perspective, for all parameters 
(except luminosity distance) the differences are at the level of factors of 2-3, while both 
configurations outperform LVKI-O5 by orders of magnitudes on all parameters (with 
less large, but still significant differences for angular localization), see Figs. 7 and 8. 
For instance, while all ET configurations detect basically the whole BBH population 
with SNR > 12, if we restrict to ‘golden events’ with SNR > 100 the 2L-15km-45? 
configuration detects (in our sample realization) 4933 BBH/yr, while the 10 km trian- 
gle 2298, and LVKI-O5 only 4. For SNR > 200 these figures becomes 644, 282 and 2, 
respectively; see Table 39 in App. C. For most parameters, as mentioned above, the 
differences between the 15 km 2L at 45? and the 10 km triangle are at the level of 
factors 2-3, in favor of the 2L; however, they are significantly larger for the accuracy 
on luminosity distance: in the 2L-15km-45? configuration there are 202 BBH/yr with 
error on dr, smaller than 1%, to be compared with 28 for the 10 km triangle (and, 
in our sample realization, just 1 for LVKI-O5). For angular localization, focusing on 
the events with AOgoy, < 50 deg? , we find 10304 BBH/yr for 2L-15km-45°, 6064 for 
the 10 km triangle and 1607 for LVKI-O5, while for AQg 9%, < 10deg? these numbers 
become 2124, 914 and 599, respectively; see Table 40.4” The corresponding difference 
in the joint accuracies on dr, and angular resolution shows that, also for this metric, 
the 2L-15km-45? is clearly superior to the 10 km triangle, see Fig. 5. 


In contrast, the 2L configurations with parallel arms are disfavored, because of their 
worse angular resolution; e.g., for 2L-15km-0° there are 3030 BBH/yr with AOggy, < 
50 deg?), to be compared with 10304 BBH/yr for 2L-15km-45°; for ANg% < 10 deg? 
the difference is even larger, with 374 BBH/yr for 2L-15km-0? (so, almost a factor of 
two less than LVKI-O5), compared to 2124 for 2L-15km-45°. 


^'Here the difference with LVKI-O5 becomes smaller: the events with very accurate angular localization 
are in general the closest, so they are accessible also to LVKI-O5, which can have good localization thanks to 
its network of five widely separated detectors. 
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e Binary Neutron Stars. Similar conclusions follow from the study of parameter es- 
timation for BNSs in Section 3.2, see Fig. 10 and Tables 42-44. Again the 2L with 
15 km arms at 45? improves on the already remarkable performances of the 10 km 
triangle, typically by factors of order 2-3, while both outperform LVKI-O5. For in- 
stance, if we restrict to ‘golden events’ with SNR > 50, the 2L-15km-45? configuration 
detects 1052 events per year, while the 10 km triangle 458 and LVKI-O5 only 3 (for 
SNR > 100 these figures becomes 134, 57 and 0, respectively). If we consider, instead, 
events with error on luminosity distance smaller than 10%, there is almost a factor of 
10 between 2L-15km-45? and the 10 km triangle, with 479 events/yr for 2L-15km-45° 
and 52 for the 10 km triangle (and only 1 for LVKI-O5; as always, in particular for 
small numbers, these are the numbers for our specific sample realization). For angular 
localization, focusing on the events with AQgoy < 100 deg”, we find 559 events/yr for 
2L-15km-45°, 184 for the 10 km triangle and 51 for LVKI-O5, see Table 43 (however, 
for a handful of very close events, localizable to better than 10 deg?, the five-detector 
network LVKI-O5 does better, and we get 25, 8 and 31 events, respectively). The left 
panel of Fig. 11 shows that, also for BNS, the 2L-15km-45° is clearly superior to the 
10 km triangle for the joint accuracy on dr, and angular resolution, that determine the 
localization volume. 


'The 2L configurations with parallel arms has again an angular localization capability 
worse than that of the 2L configuration at 45?, but the effect is less strong compared to 
BBHs (see the panel for AQgoy, in Fig. 10).4° On the other hand, the accuracy on the 
inclination angle ų for the parallel configurations is, comparatively, quite poor. Since 4 
is degenerate with the luminosity distance dr, also for dr, the parallel 2L configuration 
has significantly lower performance compared to the configuration at 45?. For instance, 
for the 2L-15km-0? we get 48 BNS/yr with error on dr, better than 1096, a factor of 
10 lower than the 479 BNS/yr for 2L-15km-45?. As for angular localization, for 2L- 
15km-0° we get 293 BNS/yr localized better than 100 deg?, so slightly better than the 
10 km triangle, but about a factor of 2 worse than the 559 BNS/yr for 2L-15km-45°; 
see again Table 43. 


For the parameter A that encodes the tidal deformability of the neutron stars, we see 
from Fig. 10 and Table 44 that, again, the 2L with 15 km arms improves by a factor 
~ 2 the already remarkable performances of the 10 km triangle, and both outperform 
LVKI-O5 by orders of magnitudes. For instance, considering the events with an error 
on A smaller than 10%, we get 2463 BNS/yr for 2L-15km-45°, 1040 for the 10-km 
triangle, and 2 for LVKI-O5. For events with an error on A smaller than 5% these 
numbers become 200, 96 and 0, respectively. 


e ET in a network with CE. When ET is inserted in a network with Cosmic Explorer 
(either a single CE 40 km detector or two 2 CE detectors, one with 40 km and one with 
20 km arms) the performance of the whole ET+CE network is, naturally, somewhat less 
sensitive to the choice of geometry for ET. However, the differences are still significant, 
particularly for the ET+1CE network. For instance, for ET+1CE, the number of BBHs 
with SNR » 200 is 1.8 x 10? when ET is in the 10 km triangle configuration, and raises 
to 2.4 x 10? for 2L-15km-45°, while the number of BBHs localized better than 10 deg? 
raises from 3.0 x 104 to 3.6 x 10^, and the number of BBHs with luminosity distance 


^5'lhis is due to the long duration of BNS signals, which allow using the rotation of the Earth to localize 
the signal, also for the parallel setting. 
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measured better than 1% raises from 2.9 x 10? to 4.3 x 10? (or from 5.3 x 10? to 7.0 x 10?, 
for ET+2CE); see Fig. 17 and Tables 39, 40. 


For BNSs, at ET+1CE, the number of detections/yr with SNR > 100 is 312 when ET 
is in the 10 km triangle configuration, and raises to 418 for 2L-15km-45?; the number of 
BNS/yr localized to better than 10 deg? raises from 2.4 x 10? to 3.8 x 10?; the number 
of BNS/yr with luminosity distance measured better than 196 raises from 4.1 x 10? to 
7.9 x 10? (or from 1.4 x 104 to 1.9 x 10+, for ET+2CE); and the number of BNS/yr 
with tidal deformability measured better than 5% raises from 243 to 400 (or from 337 
to 535 for ET--2CE); see Fig. 18 and Tables 42-44. 


In general, we see that the choice of geometry still has significant consequences also 
when ET is in a 3G network with one or two CE. 


Multi-messenger astrophysics. For multi-messenger astrophysics, the results of 
Section 4 show that, again, the 2L-15km-45? allows us to obtain a further improvement 
on the already remarkable performances of the 10 km triangle (and is comparable 
to the 15 km triangle), enabling the observation of a larger number of well-localized 
events, up to a larger redshift. The number of short GRBs with an associated GW 
signal increases by about 3096 making possible a joint detection at larger redshifts, 
and the number of expected kilonovae counterparts increases by a factor of 2. The 
better volumetric localization of 2L-15km-45? with respect to 10 km triangle, as shown 
in Fig. 5, facilitates the search of the EM counterpart giving the possibility to use 
galaxy-targeting strategy and enables a more efficient removal of contaminants. For 
pre-merger alerts with sky-localization smaller than 100 deg?, the 15 km triangle is 
performing better than the 10 km triangle and the 15 km 2L configuration, reaching 
almost the capability of the 2L configuration with 20 km arms, see Table 3. 


Stochastic backgrounds. For stochastic backgrounds, the best configurations are 
either 2L with parallel arms or the triangles, with the 2L parallel being the best below 
about 100 Hz and the triangle above 100 Hz. The 2L with arms at 45° is worse than the 
2L with parallel arms“? below about 200 — 300 Hz, while the two become comparable 
above 200 — 300 Hz (where, however, both are less good than the triangle); see Fig. 26. 
However, to exploit the better sensitivity (below 200-300 Hz) of 2L-parallel with respect 
to 2L-45? for studies of cosmology, the subtraction of the astrophysical background is 
necessary, see Section 6.5.3. An imperfect subtraction of CBC sources would pose a 
serious threat to reaching the nominal sensitivity to a cosmological background (see 
[420, 421, 516] for recent discussions), possibly even wiping out any advantage of the 
2L-parallel with respect to the 2L-45° configuration. 


The ability to resolve the angular distribution of stochastic backgrounds, through their 
lowest multipoles, is significantly better for the 2L configurations (both aligned and 
misaligned) compared to the triangle, see Fig. 29. 


The analysis in Section 6 shows that (as could be expected) the performance of the 


various geometries, with respect to the specific science cases studied, broadly follows the 
indications obtained from the more general metrics studied in Sections 3-5. The 10 km 
triangle is a game-changer with respect to LVKI-O5, allowing a jump of orders of magnitudes, 


^9See Section 2 for the definition that we use for the relative angle between the L-shaped detectors, which 
differs by about 2.51? from the more standard one using the great circle connecting the two detectors. 
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accessing physics that it is well beyond reach for 2G detectors. The 15 km 2L with arms at 
45? provides a further gain with respect to the 10 km triangle, typically by factors of order 
2-3, depending on the specific scientific question. Some examples, from the various studies 
presented in this paper, are as follows. 


e Concerning tests of GR based on BH quasinormal modes, we see from Table 13 that the 
ringdown SNR of GW150914 would have had a remarkable value of 141 in the 10 km 
triangle, which would further raise to 192 in 2L-15km-45°, inducing a corresponding 
difference in the reconstruction of ringdown frequencies and damping times, according 
to eq. (6.1). Furthermore, we see from Table 14 that the 10 km triangle would detect 
41 events/yr with a ringdown SNR higher than 50, that raise to 110 events/yr for the 
2L-15km-45° (for a ringdown SNR higher than 100, these numbers become 4 and 10, 
respectively), resulting in a further difference in performance when stacking different 
events. 


e For nuclear physics studies, we see from Table 17 that the 10 km triangle can reach a 
remarkable accuracy of 10.0 m on the radius of neutron stars (taken the same for all 
NS, just as a proxy for the actual hyper-parameters determining the NS equation of 
state); the 2L-15km-45° further brings it down to 6.4 m, thanks to a more accurate 
reconstruction of tidal deformability for individual events, and the stacking of a larger 
number of events. For the postmerger SNR, instead, we see from Fig. 44 that the 
differences between different geometries do not exceed the (10 — 20)% level, with the 
hierarchy fixed by the arm length. 


e For the reconstruction of the merger rate, Fig. 45 and Table 24 show that both the 
2L-15km-45? configurations and the 10-km triangle lead to a correct reconstruction 
of the merger rate, although the 2L-15km-45° configuration leads to a more accurate 
reconstruction, by an average factor ~ 3 in the central values. 


e For primordial BHs, assuming the PBH population model mentioned in the text, the 
10 km triangle would detect 77 events/yr at z > 30 (a smoking-gun signature for a 
primordial origin), which is already quite remarkable. The 2L-15km-45° configuration 
would further rise this by a factor ~ 3, with 238 events/yr, resulting in a limit on the 
PBH abundance stronger by a factor about 1.8; see Table 25. 


In cosmology, we see from Tables 28-34 that, for the tests that we have performed, the 
results for the accuracy on Ho, on the dark energy equation of state and on modified 
GW propagation from the various geometries analyzed are quite similar. In general, 
the 2L-15km-45? can improve the results of the 10 km triangle on the accuracy of Ho, 
wo or Ep by factors ranging between ~ (1.3 — 2.5), depending on the test." 


'To sum up, the first overall conclusion that emerges is that: 


9" Here we restricted to the 10 and 15 km triangles and the 15 and 20 km 2L at 45°. 

?Targer differences, in favor of the 15km-2L-45? with respect to the 10km triangle (and, to some extent, 
also with respect to the 15km triangle), are expected when extracting cosmological information by correlating 
dark sirens with galaxy catalogs, because of the much better volume localization of the 2L-15km-45° config- 
uration, see Fig. 5 for BBHs, and Fig. 11 for BNSs. A quantitative assessment, however, also depends on the 
completeness of the galaxy catalogs that will be available when ET will be in operation. Work on this is in 
progress. 
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1. All the triangular and 2L geometries that we have investigated can be the baseline 
for a superb 3G detector, that will allow us to improve by orders of magnitudes 
compared to 2G detectors, and allow us to penetrate deeply into unknown territories. 


How far we will penetrate into such territories, however, depends to some extent on the 
configuration chosen. From the above discussion, it follows that 


2a. The 2L-15km-45° configuration in general offers better scientific return with 
respect to the 10 km triangle, improving on most figures of merits and scientific 
cases, by factors typically of order 2-3 on the errors of the relevant parameters. 


8.1.2 Comparison between 15 km 2L and 15 km triangle 


We next summarize the main results of the comparison between the 15 km triangle and 
the 2L-15km-45? configuration, so in this case we change the geometry while keeping the 
arm-length fixed. The main results of this comparison are as follows: 


e For parameter estimation of BBHs, the 2L-15km-45° is clearly superior to the 15 km 
triangle for the estimate of luminosity distance (with 202 events/yr with dr, measured 
better than 196, against 77 for the 15 km triangle), and quite similar for all other 
parameters, see Fig. 4 and the first row of Fig. 9, and Tables 40 and 41 in App. C. 
Correspondingly, also the number of sources with good overall volume localization (say, 
error less than 196 on dr, and angular resolution better than 50 deg?), is clearly better 
for the 2L-15km-45°, see the right panel in Fig. 5. 


e The same holds for parameter estimation of BNSs, see Fig. 10 and the first row of 
Fig. 15, and Tables 43 and 44 in App. C. In particular, the 2L-15km-45? has 479 
events with dr measured better than 1096, to be compared with 153 for the 15 km 
triangle (which become 4328 to be compared with 1756, requiring an error smaller 
than 3096); see Table 43. Instead, for angular resolution, the 15 km 2L with arms at 
45? and the 15 km triangle are quite comparable (e.g., for the BNS localized better 
than 100 deg?, with a threshold SNR > 12 we find 559 events/yr for 2L-15km-45° and 
479 for the 15km triangle, see Table 43, which become 644 and 764, respectively, with 
SNR > 8, see Table 1. At this level, the comparison is of course affected also by 
sample variance). The respective joint distribution of events with respect to distance 
and angular localization is shown in the right panel of Fig. 11, which again shows an 
overall preference for 2L-15km-45?. For all other parameters the performances are quite 
similar. 


e This reflects also on several aspects of the science case, where we see that the 2L- 
15km-45? and the 15 km triangle have similar performances; this can be seen, e.g., in 
Tables 1, 6 and 10 for multi-messenger astrophysics (except for the pre-merger alerts for 
which the 15 km triangle results are better, as shown in Table 3); Tables 13 and 14 for 
physics near the BH horizon; Fig. 39 for tidal deformability of exotic compact objects; 
Table 17 for the measurement of the NS radii; Fig. 44 for the post-merger signal of 
BNS coalescences; Table 24 for the merger rate reconstruction (where the 2L-15km-45? 
is a factor of 2-3 better than the 15 km triangle); Table 25 and Fig. 47 for primordial 
BHs; or Tables 28-36 for cosmology. 
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The overall conclusion, in this comparison, is that: 


2b. The 2L-15km-45° configuration and the 15 km triangle have very similar perfor- 
mances on all parameters both for BBHs and BNSs, except for luminosity distance, 
where the 2L-15km-45° configuration is better by a factor ~ 3 in the number events 
with accurately measured distance. 


8.1.3 A single L-shaped detector 


Finally, we have considered the performance of a single L-shaped detector with 20 km arms, 
still with the ET characteristics in terms of ASD. The conclusion is that such a detector, 
taken as a single detector not inserted into a network with other 3G detectors, is not capable 
of delivering the science expected from the next generation of detectors. Its capability of 
angular localization of the source, and of reconstruction of the luminosity distance, would 
even be very much inferior to what the LVK network is expected to reach by the end of the 
O5 run, see the corresponding panels in Figs. 6 and 12 (actually, the number of well-localized 
sources detected per year would not even really improve on the results already obtained 
with the current GWTC-3 catalog of detections). This would result in a complete loss of 
all the science case aspects of 3G detectors related to multi-messenger astrophysics and to 
cosmology. Stochastic backgrounds of GWs, that in the frequency band of ground-based 
detectors can only be detected by correlating the output of two or more detectors, would also 
not be accessible to a single 3G detector, giving up again the corresponding aspects of the 
science case related to early Universe cosmology and to astrophysics. Furthermore, with a 
single detector, unmodeled burst signals and short signals due to massive BBH coalescences 
would be hard to distinguish from instrumental glitches, rendering problematic any confident 
detection of transient sources. 

Therefore, a single L-shaped detector does not have a valid science case, even with 20 km 
arms (or longer). If inserted in an international network with 2CE, such as detector could 
be a useful addition to the network. However, given the scale of the investment, ET must 
have a solid science case, independently of the decisions that will be taken by other funding 
agencies, unrelated to a European project. Therefore: 


3. A single L-shaped detector is not a viable alternative, independently of arm length. 
If a single-site solution should be preferred for ET, the detector must necessarily have 
the triangular geometry. 


8.1.4 Further aspects of the triangle-2L comparison 


In this section we summarize our discussions of further aspects that are relevant to the 
comparison between the triangle and the 2L configurations. 


e Correlated noise. For the triangle vs. 2L comparison an important aspect, that needs 
further studies, is that the triangle configurations suffer from a potential threat from 
correlated Newtonian, seismic and magnetic noise, or lightening strikes; while significant 
uncertainties exist in their estimate, these correlated noise might spoil the sensitivity to 
stochastic backgrounds below some frequency and, possibly, even to unmodeled bursts. 
A network of two widely spaced L-shaped detectors is significantly less sensitive to 
these problems. 
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More in detail, an important potential danger, for all triangle configurations, is due to 
the fact that some mirrors of different nested interferometers are apart by a few hundred 
meters and, on this scale, correlated seismic and Newtonian noise can be important. 
This has been studied in detail only recently [138]. According to that analysis, such 
correlated noise would swamp the sensitivity to stochastic backgrounds of ET, in its 
10 km triangle configuration, up to frequencies of order (50 — 100) Hz (blue curve in 
Fig. 33), to the extent that the actual sensitivity of ET to stochastic backgrounds, 
up to these frequencies, would not improve on LIGO’s A+ and Virgo's AdV- design. 
Different assumptions could result in a lower level of noise, also shown in Fig. 33. With 
the most optimistic assumptions, the search for a stochastic background would still be 
impacted up to ~ 10 Hz (pink curve in Fig. 33). Such problems arise from correlations 
on the scale of hundreds of meters, and therefore do not concern a 2L configuration 
with widely spaced detectors. 


Magnetic noise will also pose difficulties to the triangle configuration: in the case of a 
very pessimistic assumption, where the magnetic noise in ET is the same as in the Virgo 
central building and it is fully correlated between two ET interferometers, it will affect 
the ASD sensitivity over the entire frequency band, by several orders of magnitudes at 
low frequencies, f S O(30) Hz, and by a factor 1-10 at higher frequencies, see the lower 
panel in Fig. 34. Again, this would only apply for the co-located interferometers in the 
triangular configuration. However, further research is needed to understand to which 
level these infrastructural noise sources would be correlated. 


Compared to the triangular design, the 2L configuration is also less prone to effects com- 
ing from coherent lightning strikes, which could affect not only stochastic background 
searches, but, potentially, also the search for unmodeled bursts. 


The null stream. The triangle has the advantage of having a null stream, where the 
GW signal cancels. This will certainly be beneficial. However, while the mathematics 
of the null stream is elegant, when one is confronted to its use in the concrete setting of 
the experiment several issues arise, that make it difficult to quantify reliably its impact. 


First of all, the triangle null stream is only operative when the interferometers in all 
three arms are on, so it is quite sensitive to the duty cycle. In this work we have 
assumed an uncorrelated 85% duty cycle in each L-shaped detector, and in each of the 
three instruments composing the triangle, see Section 3; in this case the null stream 
will be available for (0.85)? = 61% of the run time. Assuming instead an independent 
duty cycle of 80% for each arm (the best duty cycle for the O3b LVK run is 79% 
[517]), implies that the null stream will only be available for 51% of the run time. 
However, the triangle design is considerably more complex than that of 2G detectors, 
with two instruments in each arm (an HF instrument, and a LF instrument working at 
cryogenic temperatures), and it is difficult to reliably estimate what the duty cycle will 
be. In a (hopefully very pessimistic) scenario where each of the six instruments has an 
independent duty cycle of 8096, the null stream would only be available for 2696 of the 
run time. 


An advantage of the null stream is that it provides a direct estimate of the instrument- 
noise PSD, see eq. (7.3). Here, however, enters the assumption that noise in different 
arms are uncorrelated. Noise produced identically in two interferometers (as could be 
the case for the natural magnetic background or seismic gravity fluctuations) would 
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not appear in the null stream, and would therefore bias the PSD estimates that use the 
null stream (unless these noise can be identified and characterized, e.g. with witness 
sensors). 


Assuming to have obtained an unbiased estimate of the PSD, the most direct impact 
would be on the measurement of the stochastic gravitational-wave background. An 
unbiased estimate of the PSD would also be beneficial for compact binary coalescences, 
to avoid a reduction of the detection horizon from the confusion noise of the unresolved 
BNS (that, being long-lasting, can produce a continuous backgrounds, contrary to 
BBHs that rather produce a *popcorn noise, see Section 5). However, here the advantage 
of the null stream should be weighted against the possibility of making longer arms. 
Fig. 68 shows that, assuming a value of the local rate Ro of BNS mergers equal to 
the median of the currently allowed range, the reduction in the horizon range for the 
2L detectors, due to this confusion noise, would be between 296 and 7.596 for CBCs 
with masses below 20M, and negligible above. With extreme assumptions on the rate 
Ro, taken at the upper limit of the currently allowed range, the reduction would be 
between 5% and 20% for CBCs with masses below 20M, and again negligible above. 
The triangle null stream would be able to mitigate this reduction, possibly eliminating 
it completely, which is certainly very beneficial. On the other hand, we see from 
Fig. 3, left panel, that, in the same range of masses, the 2L 15km at 45? configuration 
has an horizon distance larger than the 10 km triangle by a factor ~ (1.4 — 1.8), i.e. 
(140 — 180)%. 


The confusion noise also impacts the calculation of the false alarm rate. The null stream 
would allow estimating the noise background distribution directly (e.g. time shifting 
the null stream with respect to itself). In this way one could obtain more accurate 
estimates of the false alarm rate. 


'The null stream can be very effective at subtracting glitches, that would appear as 
non-Gaussian outliers in the null stream. The most prominent benefit in this case is 
expected for high-mass distant BBHs, that mimic glitches, as well as for signals that 
are only described by phenomenological models. In a 2L configuration, one would have 
to rely only on coincidences between detectors, and environmental sensing. 


Another potential virtue of the null stream is that is reduces the uncertainty in the 
calibration (although it only provides relative calibration errors). However, the level 
of systematic errors in detector calibration in ET is not known. As discussed in Sec- 
tion 7, further studies are needed to demonstrate the performance of the ET null stream 
calibration, compared to other calibration techniques. In the worst case scenario, in 
2L detectors calibration uncertainties will have to be estimated simultaneously with 
gravitational-wave source parameters, which would increase the computational cost of 
the analyses and introduce additional uncertainty. 


In summary, while in the triangle configuration the null stream would certainly be 
beneficial, its concrete application is subject to several uncertainties that are diffi- 
cult/impossible to model before building and commissioning the detector, such as the 
duty cycle, the level of correlated or identical noise in different arms, the level of non- 
gaussianity in the noise or, as for instance in the case of detector calibration, the state 
of advancement of other alternative techniques. 


e Site optimization for 2L. Finally, it should be observed that, in this work, no effort 
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was made to optimize the distance between the two L-shaped detectors, even just within 
Europe, and we simply used the locations of the two current candidate sites. A longer 
baseline would improve in particular the localization accuracy, as well as the estimation 
of the parameters more strongly correlated with angular localization.?? 


8.2 The role of the low-frequency sensitivity 


We now draw our conclusions from the comparison between the two different choices of the 
amplitude spectral density (ASD) that we have studied, namely the full xylophone design, 
consisting of a high-frequency instrument together with a low-frequency instrument working 
at cryogenic temperature (‘HFLF-cryo’), and the situation in which there is only the high- 
frequency instrument (‘HF-only’). The main messages that emerge from our analysis are the 
following: 


4. The low-frequency sensitivity is crucial for exploiting the full scientific potential 
of ET. In the HF-only configuration, independently of the geometry chosen, several 
crucial scientific targets of the science case would be lost or significantly diminished. 


For instance: 


e For all geometries, in the HF-only configuration the number of BNS sources localized 
to better than 10? deg? (which are the sources relevant for multi-messenger studies) 
degrades catastrophically; with the threshold SNR > 8 that we use for multi-messenger 
studies, we see from Tables 1 and 2 that, for the 10 km triangle, the number of detections 
with Ago, < 10? deg? (and all orbit inclinations) decreases from 280 to 14 BNS/yr, 
ie. by a factor of 20 (and from 26 to 7 for events with viewing angle less than 15°), 
while for the 2L 15km-45? it decreases from 644 to 76 (for all orbit inclinations), or 
from 68 to 35 (for viewing angle less than 15?). Note that the reduction is more severe 
for the 10 km triangle than for the 2L 15km-45? configuration. Increasing the SNR 
threshold the effect is even stronger: for SNR > 12, the number of events localized to 
better than 10? deg? decreases from 184 to 4 BNS/yr for the 10 km triangle while, for 
the 2L 15km-45°, it decreases from 559 to 11; see Table 43. 


Indeed, the decrease is so severe that the number of BNS/yr localized to better than 
10? deg? would even become much lower than what will be obtained already by LVKI 
during the O5 run, see the panel on AQgo% in Figs. 13 and 14, and Table 43: as we 
mentioned, the number of BNS/yr with SNR > 12 and AQggy, < 10? deg? would be 11 
for the 2L 15km-45? HF-only, and 4 for the 10 km triangle HF-only; by comparison, 
for LVKI-O5 we find 51 BNS/yr with SNR > 12 and AQ)gos; < 10? deg?. 


The accuracy on the measurement of the luminosity distance also degrades dramatically. 
For instance, for the 2L with 15 km arms at 45°, the number of BNS/yr with dr, 
measured to better than 10% degrades from 479 in the HFLF-cryo configuration, to 12 


52A third possible candidate site, depending on the outcome of the geological studies, could emerge near 
Kamenz, in the Lusatia region, Saxony (Germany). The great circle chord distance between the Sardinia site 
and the site in the Meuse-Rhine is 1165.0 km, while that between the Sardinia site and Kamenz is 1247.3 
km, so the two distances are quite comparable, with the Sardinia-Kamenz distance larger by about 7%. In 
contrast, the great circle chord distance between Kamenz and the site in Meuse-Rhine is 575.4 km, significantly 
smaller. 
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in the HF configuration, while for the 10 km triangle it degrades from 52 to 1; for 2L- 
15km-45°, the number of events/yr with dr, measured to better than 5% degrades from 
O(100) in the HFLF-cryo configuration, to about 1. The same trend is observed for BNS 
‘golden events’, see Fig. 15. The accuracy on many other parameters would also degrade 
significantly, see again Figs. 13 and 14. These results are due to the fact that, with the 
low-frequency sensitivity ensured by the LF instrument, BNSs stay in the bandwidth 
for a long time, with a corresponding crucial benefit for parameter estimation and, 
particularly, localization and the partial disentangling of the polarizations, that allows 
us to alleviate the degeneracy between dz and cos i. 


For the same reason, for BNS, pre-merger alerts for localized events would become 
impossible without the low-frequency instrument, compare Table 3 with Table 4. For 
instance, for the 10 km triangle the total number of BNS/yr detected 30 min before 
merger would go from 905 (of which 10 localized, 30 min before merger, to better 
than 10? deg? and 85 better than 10? deg?) to zero; for the 2L 15km-45° it would go 
from 2172 (of which 20 localized, 30 min before merger, to better than 10? deg? and 
194 better than 10? deg?) again to zero. As an example, an event such as GW170817 
would enter the E'T bandwidth about 1 day before the merger with the full HFLF-cryo 
sensitivity, but only O(10) minutes before the merger with an ASD such that it entered 
the detector bandwidth only at 10 Hz. 


This would have a dramatic impact on the possibility to detect precursors, and to 
probe the prompt/early counterparts, which provide rich information on the physics 
acting in the GRB engine and jet launch and kilonova ejecta. Of particular interest are 
prompt/early detection in the very high-energy gamma rays and ultraviolet. 


For the multi-messenger studies, losing the low frequencies reduces the detection of 
short GRBs with associated gravitational-wave signals; the number of joint detection 
decreases by about 4096 for the 10 km triangle, by about 3096 for the 15 km triangle 
and 2L with 15 km arms at 45?, and 2096 for the 2L with 20 km arms at 45?, see 
Tables 6 and 7. The reduction is more severe for kilonova counterparts: the triangle 
HF configurations detect less than 696 of the kilonovae detected by triangle HFLF- 
cryo, while the 2L HF-only configurations detect less than 1596 of the KNe detected 
by the 2L HFLF-cryo configuration, see Tables 10 and 11. Note that, once again, 
the triangle geometry is more heavily affected than the 2L by the loss of the low 
frequency instrument. The reduction of short GRBs with an associated GW signal 
might impact our understanding of the physics of the GRB engine and relativistic jets, 
by requiring more years of joint observations to achieve the same science goals. However 
for cosmology, nuclear physics, and studies of the heavy-elements nucleosynthesis which 
rely on kilonova detections, loosing the LF has a major impact on achieving the goals 
themselves. 


It is worth noting that, for short GRBs, the HF-only 2L-15km-45? and the HF-only 
15km triangle enable the detection of the GW signal for a number of GRBs comparable 
to the full HFLF-cryo triangle with 10 km arms. On the other hand, for the kilonovae 
counterparts, no HF-only configuration (even HF-only 2L with 20 km arms at 45?) is 
comparable to 10 km triangle HFLF-cryo. This is due to a different type of search: while 
to observe short GRBs, wide FoV satellites in survey mode are used and the temporal 
coincidence of the GW/GRB signals are used to identify a joint detection, to search 
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for kilonovae the telescopes FoV is typically much smaller than the GW localization 
uncertainty and thus requires selecting better-localized events to be followed up. The 
much-improved localization obtained by accessing low-frequencies then becomes a key 
parameter for the search. 


e The HF-only configuration has a significantly smaller reach in distance, compared to 
the HFLF-cryo configuration, in the whole mass range relevant to 3G detectors, as can 
be seen from the right panels in Figs. 2 and 3. 


'This has several significant consequences. In the range of masses relevant to BNS, 
for the 10 km triangle, the redshift to which a BNS with optimal orientation and sky 
location can be detected would decrease from z ~ 4 down to z ~ 2 (or, for the 2L- 
15km-45? , from z ~ 6 to z œ 3). This is particularly significant since the peak of the 
star formation rate is around z ~ 2 — 3, so studies of demography and population of 
BNS would be strongly impacted. 


The lowest reach in redshift would also make it impossible to identify primordial BHs 
on the basis of the criterion that BBHs at z2 30 cannot be of astrophysical origin. 
For instance, for the 10 km triangle, the maximum redshift at which a compact binary 
coalescence could be seen, for optimal values of the total mass and optimal orientation 
and sky location, reduces from z ~ 100 in the HFLF-cryo configuration to z ~ 20 in 
the HF configuration, see Fig. 2. 


Intermediate mass BHs, with masses in the range (10? — 10*) Mc, would also be visible 
to significantly smaller distances, see again Fig. 2, reducing significantly the chances of 
discovery. As an example, a BBH with total mass 5 x 10? Mọ would be visible up to 
z ~ 1.1 in the HFLF-cryo configurations (almost independently of geometry), but only 
up to z ~ 0.6 in the HF only configurations, corresponding to a reduction by a factor 
~ 5 in the comoving volume explored and in the corresponding chances of detection. 


e The measurement of the eccentricity of subsolar mass compact objects, an important 
criterion for assessing or excluding the primordial origin of a BBH, would degrade by 
more than one order of magnitude, see Fig. 50. 


e The degradation of the accuracy in luminosity distance and angular localization would 
basically render impossible all studies of cosmology, such as accurate measurements of 
A and especially studies of the dark energy equation of state and of modified GW prop- 
agation, which crucially rely on accuracy in sky localization and luminosity distance for 
events at large z. Similarly, the exquisite localization accuracy on some high-mass ratio 
events, needed to extract the Hubble parameter from them, see Section 6.4.3, would be 
impossible without the LF instrument. 


e For a number of other aspects of the science case the loss of the LF instrument would not 
be as disruptive, but would still results in a loss of accuracy on the relevant parameters, 
by factors of order 2-3. This is the case, for instance, for the possibility of distinguishing 
BHs from Exotic Compact Objects on the basis of the spin-induced quadrupole moment 
or of a non-vanishing tidal deformability (Figs. 38 and 39), or for the accuracy on the 
reconstruction of NS radii, see Table 17. 


On the other hand, not all aspects of the Science Case depend on the LF instrument, and 
another important conclusion that follows from our study is: 
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5. There are some important targets of the Science Case that depend only on the 
HF sensitivity, and that could be fully reached with an HF-only instrument. 


In particular: 


For multi-messenger astronomy, the number of joint detections of GWs from a BNS 
merger and the associated X-ray afterglow, as could be detected by an instrument 
such as THESEUS in survey mode, is basically independent of the LF instrument; see 
Table 8. 


Tests of physics near the BH horizon based on the ringdown signal of the final BH 
are completely independent of the LF instrument, see Table 13 and the discussion in 
Section 6.1.1. The same holds for the search of echoes and near-horizon structures, see 
Section 6.1.2. 


'The post-merger signal of BNS coalescences, that contains the information of the NS 
equation of state, is concentrated in the high-frequency region, and is completely in- 
sensitive to the presence of the LF instrument, see the left panel in Fig. 44. 


The detectability of sub-solar mass primordial BHs is another example of a metric that 
is not affected by the loss of the LF instrument; see Fig. 49. 


Stochastic backgrounds of cosmological origin are in general expected to have a smooth 
power-like behavior, Qew(f) « f^, over the bandwidth of ground based or space-borne 
detectors (with the exception of backgrounds from cosmological phase transitions, that 
will be broadly peaked around a characteristic frequency). At the very low frequencies, 
say f ~ 10-15 Hz, tested by CMB, there are extremely stringent limits OQew S 10715 
(see, e.g. Chapter 22 of [85]). Therefore, at the frequencies of ground based detectors, 
a stochastic background, relic of the early Universe, could only be detectable if it grows 
with frequency, Qew(f) x f% with a > 0, at least until a frequency where it saturates, 
followed by a cutoff. For this reason, the stochastic background predicted by standard 
single-field slow-roll inflation is not detectable, since it is basically flat (in fact, slightly 
red, ie. oa < 0). However, there are alternative cosmological models that predict 
spectra with o 0, such as the pre-big-bang model [518, 519] or axion inflation (see 
[520], or Fig. 12 of [15]). In these cases, a significant part of the SNR could accumulate 
in the high-frequency part of the detector bandwidth, where the LF instrument does not 
contribute to the sensitivity to stochastic backgrounds, see Fig. 28. A similar behavior 
takes place, in the ET bandwidth, in the model for cosmic strings whose predictions 
are shown in Fig. 63. 


A final important conclusion that emerges from our study is that: 


6. For some important aspects of the Science Case, the 2L with 15 km arms at 
45°, already in the HF-only configuration, is comparable the 10 km triangle in a full 
HFLF-cryo configuration. 


In particular: 
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e For parameter estimation of BBHs, the 2L with 15 km arms at 45? in the HF-only 
configuration is comparable to the 10 km triangle at full HFLF-cryo sensitivity, with 
better performance of luminosity distance, less good performance on mass reconstruc- 
tion, and equivalent performances on all other parameters and in SNR distribution, see 
Fig. 8. The performance of the 2L-15km-45? configuration is also equivalent to that of 
the 10 km triangle for what concerns ‘golden BBH events’, see the lower row of Fig. 9. 


e From the right panel of Fig. 40 we see that, for the measurement of the neutron star 
radii and the consequences for nuclear physics that we can derive from it, the 2L with 
15 km arms at 45? already in the HF-only configuration has performances very similar 
to that of the full 10 km triangle HFLF-cryo. 


e For all the items discussed above, where the LF instrument does not contribute (joint 
GW--X-ray afterglow and, to some extent, GRB detections, tests of physics near the 
BH horizon, post-merger signal of BNS coalescences, sub-solar mass BHs, stochastic 
backgrounds growing as f* with a > 0), the 2L-15km-45? in the HF only configuration 
will be superior to the 10 km triangle with the full HFLF-cryo sensitivity. 


8.3 Conclusions 


'The decisions on the ideal design of a detector such as the Einstein Telescope, which is 
meant to have a leading role in the field of gravitational waves for decades, are part of a 
complex process. In this study we have considered alternatives to the geometry (a single-site 
triangle against two widely separated L-shaped interferometers, with different options for 
the arm-lengths) and to other aspects of the design (considering the baseline design given by 
a full xylophone configuration with a high frequency instrument together with a cryogenic 
low-frequency instrument, and comparing it to a high-frequency only instrument). 

The first conclusion of this work, which extends the study presented in [15], is that, 
whatever geometry is decided between the triangle and 2L configurations considered, ET will 
clearly be a game-changer. It will over-perform the network of 2G observatories at their best 
expected O5 sensitivities by several orders of magnitudes in all relevant metrics, accessing 
physics that it is well beyond reach for 2G detectors. Note, however, that this is only true 
for the triangle and for the 2L configurations. A single L-shaped detector, not inserted in a 
global 3G network, even with a very long arm length, does not have a viable Science Case, 
at the level expected from a 3G detector. 

The comparison between the 10 km triangle and a 15 km 2L shows that, from the 
scientific point of view, the 15 km 2L with arms at 45? is superior on basically all the 
metrics that we have considered, with the exception of the nominal sensitivity to stochastic 
backgrounds. However, in this case we have seen that the triangle suffers from a potentially 
very significant treat from correlated noise, particularly at low frequencies. For compact 
binary coalescences, the better performances of the 15 km 2L with arms at 45? allow us to 
gain typically further factors of order 2-3 on the number of events which pass given cuts on 
SNR, or on the accuracy of reconstruction of various parameters and physical quantities. In 
fact, we have found that the 15 km 2L with arms at 45? is quite comparable to the 15 km 
triangle: the two have very similar performances for the reconstruction of all parameters of 


55Turthermore, more work is needed to asses the effect of an imperfect subtraction of the astrophysical 
backgrounds, which could limit the sensitivity to cosmological backgrounds to a level fixed in all cases by the 
accuracy of the subtraction. 
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compact binary coalescences, except for luminosity distance, for which the 15 km 2L with 
arms at 45? is significantly better. 

Another interesting element that emerged from our study is that the configuration 
2L 15km at 45° gives a better possibility of proceeding in steps, compared to the 10 km 
triangle. Both for the triangle and 2L configurations, the full scientific outcome expected 
from a 3G detector will only be reached in the full configuration with the HF and the 
cryogenic LF instrument. However, the 2L 15km at 45°, already in the HF-only configuration, 
would be sufficient to obtain some important results (which is true only to a lesser extent 
for the 10 km triangle). It is therefore natural to investigate whether a staging procedure 
will be advantageous, where the HF interferometer, which is the simpler to commission, is 
put into operation first, until the full HFLF-cryo configuration is reached, possibly going 
through intermediate steps such as, for instance, an LF instrument working, at first, at room 
temperatures. These issues, however, are beyond the scope of this study, and should be 
examined within the context of the ET Instrument Science Board. 

Finally, the triangle configuration is prone to the effect of correlated Newtonian, seismic 
and magnetic noise on the scale of hundreds of meters. While more work is needed to fully 
understand them, these effects might pose a risk/challenge to obtaining the target sensitivity 
of the triangle, particularly in the low-frequency region. 
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A Basic formalism for stochastic backgrounds 


In this appendix we review several basic concepts relevant for the characterization of stochas- 
tic GW backgrounds. At the frequencies of ground-based detectors, to obtain an interesting 
sensitivity one needs to cross-correlate the output of at least two detectors in a network, see 
e.g. [83, 521] for reviews: as a consequence the observable quantity is not a strain, but rather 
quantities quadratic in the strain. One usually introduces a polarisation tensor defined (in 
frequency space) as 

Pah hes (A.1) 


with a,b = +, x and we defined the superposition of signals in a given direction n and at a 
given frequency f as 


hij (a, f) = hy (a, fe (A) + hx ( 


>> 


fej (8) . (4.2) 


In full generality the polarisation tensor, being a complex 2 x 2 tensor, can be decomposed 
in a basis of the identity and Pauli matrices as 


Puli, f) = 5 [168. Plas + Ua, No) HVA o e Qs Dot] A3) 
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where the coefficients 7, U, V, Q are Stokes parameters describing intensity and polarisation, 
respectively. Polarisation is expected to be very small if we consider a homogeneous and 
isotropic AGWB and a CGWB: the amount of polarisation generated via classical diffusion 
is very small for both background components, see [105]. However, there is a non-negligible 
amount of circular polarization in the AGWB, generated by Poisson fluctuations in the num- 
ber of unresolved sources, which can be detected by third-generation interferometers with 
SNR» 1 (see [522]). Measuring a highly polarised background component would therefore be 
interesting both for shedding light on some cosmological parity-violating generation mecha- 
nisms (see e.g. [523, 524]) or to estimate whether the SGWB comes from a handful of sources 
or a relatively large population of binaries. Here we focus on the study of intensity and of 
its spectrum. It is common in the literature to introduce a dimensionless quantity related to 
intensity via 


An? f? " 
GI (i. f) 
1 dpgs(R, f) 


~ p. din f dn ' PM 


Ogy (fi, f) 


where the second equality clarifies the physical meaning of this observable: it corresponds to 
the background energy density, pgw(n, f), per unit of logarithmic frequency and unit solid 
angle (normalised to the critical density of the Universe today, pe), that reaches the observed 

from a direction n. 
'To characterize the detector sensitivity to stochastic backgrounds, we use the Power- 
Law Integrated sensitivity curve (PLS), introduced for the first time in [142] for a network 
of L-shaped detectors, and we extend it to the case of triangular detectors, following [116]. 
When dealing with triangular detectors (three nested detectors), one has to keep in mind 
that the noise between two detectors with an arm in common is not uncorrelated, hence 
the correlation has to be accounted for when computing the PLS. Here we assume that the 
noise in ET is 20% correlated between detectors with an arm in common. As we discuss in 
Section 5.4, see in particular Fig. 33, this is a value intermediate between the extremes of 
modeled noise correlations; we chose here a frequency-independent correlation for simplicity. 
Let us assume to have a network of N detectors. The antenna pattern for a detector 

pair AB is given by 
Aas(a, f) = yApe "rfe ban, (A.5) 


where bap is the baseline vector connecting the center of mass of two detectors A and B in 
a network, while the overlap reduction function is defined as 

yas = = [Trahet)Irahet)* + Take”) TH(ahe*)'] . (A.6) 
in terms of the polarisation tensors et’* and the detector response tensor a4. The latter, for a 
Michelson interferometer with arms pointing along the directions u4 and v4, has components 
a given by p _ v 

ay = (uw, — v4v4)/2. (A.7) 

Notice that the factor 5/(87) in eq. (A.6) is a conventional factor introduced so that the 
overlap functions integrate to unity over the sphere for two co-located and perfectly aligned 
detectors with orthogonal arms. 


— 146 - 


We define the average antenna pattern function, often referred to in the literature as 
the average overlap function as 


oan) = f a B a (A.8) 
We start with the effective noise power spectral density. In full generality, this is given by 
=1/2 
Seg = | 35 (Ng Psc(f) UN ) palf) ; (A.9) 
ABCD 


where we are taking the trace of the product of four matrices. Notice that for the case of 
uncorrelated detectors this expression reduces to 


E ÓAB 
(N; Dues NA ; (A.10) 


where NV > is the noise power spectral density (PSD). In this case Eq. (A.9) can be simplified 
to: 


NN -1/2 

JAB 

Sep = x 4 ] (A.11) 
Aca Bs ENT 

However, for triangular detectors, we need to know both the noise variance (auto-correlation) 

N H, and the cross-detector covariance N f We convert Eq. (A.9) to energy density units 

using 


= 3Hj alf) . 


Sea(f) m Ag? f3 (A.12) 
For a set of power-law indices f, we write 
Qaw(f) = Qa (f/ fret)? (A.13) 


where fref is a reference frequency such that, at f = fier, Qaw(f) has the value Qg, and we 
compute the value of the amplitude Qg such that the integrated signal-to-noise ratio p has 
some fixed value. This is given by 


pL pine ad is 
Qg = Var n QE) df. (A.14) 


In the following we will set p = 1. For each pair of values (6,08), we compute New = 
Q3(f/ fret)? The envelope is the power-law integrated sensitivity curve. Formally it is given 
by 


B 
Oprs(f) = maxg le (i) | : (A.15) 


Any line (on a log-log plot) that is tangent to the power-law integrated sensitivity curve corre- 
sponds to a gravitational-wave background power-law spectrum with an integrated signal-to- 
noise ratio p = 1. This implies that, if the curve for a predicted background lies everywhere 
below the sensitivity curve, then p « 1 for such a background. Note that, from the defini- 
tion (A.13) of Qg and fref, a change in the choice of fref is compensated by a corresponding 
change of Qg, such that Qg Pr remains constant. Therefore, the value of Qg obtained from 
eq. (A.14) depends on the arbitrary choice of fref, but Oprs(f), given by eq. (A.15), does 
not. 
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Figure 69: Overlap reduction function (ORF) as a function of the angle o. 


B Sensitivity to stochastic backgrounds of misaligned 2L configurations 


Focusing on the stochastic gravitational-wave background, we highlight below the effect of 
the orientation of the two L-shaped interferometers on the peak sensitivity. As discussed 
in Section 2, in this paper we have fixed the arms of the Sardinia interferometer so that 
they point locally East and North, and rotate anti-clockwise the arms of the Netherlands 
interferometer by an angle a with respect to the local East-North directions. The angle 
a = 0 differs from the usual definition of the alignment of two interferometers with respect to 
the great circle [21, 22] by 2.51 degrees and is clearly not the maximum possible alignment, 
due to the curvature of the Earth between the two sites. 

In Fig. 69 we show the overlap reduction function (ORF) as a function of the angle 
a, for the 0 deg and 45 deg cases, which are the examples considered in the main text. 
However, the best and worst cases for a stochastic search are the values a = 2.51 deg and 
o = 47.51 deg, respectively. Clearly, while varying the angle o near the 0 deg case has very 
little impact, the consequences of varying o just a few degrees around the 45 deg are very 
important. These results reflects into a significant dependence on a of the peak sensitivity 
to the stochastic gravitational-wave background. 

Figure 70 illustrates the peak sensitivity to the stochastic gravitational-wave background 
(minimum value of the power-law integrated curve) as a function of the angle o. The choices 
œ = 0 deg and 45 deg, considered in the main text of this document, are not respectively 
the best and worst cases for stochastic searches. Varying the angle o near the 0 deg case has 
very little impact, but varying a just a few degrees around the 45 deg case varies the peak 
sensitivity to the stochastic gravitational-wave background by orders of magnitude. As one 
can clearly see, the best and worst cases for stochastic searches, are when a = 2.51 deg and 
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Figure 70: Peak sensitivity to the stochastic gravitational-wave background as a function of the 
angle a. 


47.51 deg, respectively. 

To be strictly optimal the alignment of two interferometers would have the two detectors 
such that each had an arm along the great circle connecting them for 0 deg (which is how the 
two LIGO detectors are aligned), and rotating one by 45 deg for the minimal value [22]. For 
such a minimal alignment the overlap reduction function would be exactly zero in the limit 
[Az — 0, where Az is the separation between the two detectors, and very small basically for 
all frequencies, as we see from Fig. 69. This would also give the best polarization separation 
for the CBC searches. 


C Tables of figures of merit for BBHs and BNSs 


To make the comparison among different geometries and ASDs easier, we provide here tables 
with the number of sources detected by each configuration with various cuts on the match- 
filtered SNR, or on measurement errors on some of the most relevant quantities, corresponding 
to the results shown in Figs. 4, 7, 8 and 9 for BBHs, and to Figs. 10, 13, 14 and 15 for BNS. 
We further show the same quantities for the networks ET+1CE and ET+2CE, corresponding 
to the results shown in Figs. 17 and 18. 

In particular, in Table 39 we show the number of detected BBH sources with different 
cuts in SNR, ranging from 8 to 200; in Table 40 with cuts on the relative error on the lumi- 
nosity distance and the 90% sky localisation [notice that the parameter estimation analysis 
has been performed only for the events detected with SNR > 12]; and in Table 41 with cuts 
on the relative error on the detector-frame chirp mass and the error on the spin magnitude 
of the primary object. 
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In Table 42 we report the number of BNS sources detected with different cuts in SNR, in 


Table 43 with cuts on the luminosity distance relative error and 90% sky localisation area, 


54 


and in Table 44 with cuts on the relative errors on the chirp mass and adimensional tidal 


deformability parameter A [defined in Eq. 3.1]. 


Configuration SNR > 8 | SNR > 12 | SNR > 50 | SNR > 100 | SNR > 200 
A-10km-HFLF-Cryo 103 528 87 568 13674 2298 282 
A-15km-HFLF-Cryo 111231 101 308 26 092 5730 759 
2L-15km-45°-HFLF-Cryo 107 661 97 205 23 491 4933 644 
2L-20km-45°-HFLF-Cryo 110 698 103 773 34009 8828 1267 
2L-15km-0°-HFLF-Cryo 104 935 94015 24 088 5143 642 
2L-20km-0°-HFLF-Cryo 106 417 98 274 32915 8551 1246 
A-10km-HF 87 125 65 092 5595 773 98 
A-15km-HF 102 149 85 698 13 697 2360 292 
2L-15km-45°-HF 97 881 81 210 12089 1987 248 
2L-20km-45°-HF 105 032 93 050 20551 4144 515 
2L-15km-0?-HF 89 707 73 696 10688 1732 201 
2L-20km-0°-HF 104 558 92 308 21970 4540 569 
A-10km-HFLF-Cryo+CE-40km 115179 110118 44 676 12590 1805 
2L-15km-45°-HFLF-Cryo+CE-40km | 116328 112661 50 947 15 545 2355 
2L-15km-0?-HFLF-Cryo--CE-40km || 114816 | 110265 49 034 14820 2243 
A-10km-HFLF-Cryo+2CE 117045 113910 52 092 16 109 2505 
2L-15km-45°-HFLF-Cryo+2CE 117 436 115 166 57 678 19 028 3126 
2L-15km-0°-HFLF-Cryo+2CE 116 639 113 597 55 218 17 849 2917 
LVKI-O5 8603 2861 4T 4 2 


Table 39: Summary table of the number of BBH detections with various cuts in SNR for the 
considered ET geometries and ASDs, also in combination with one 40 km CE detector or two CE 
detectors and for a LVKI network during O5. 


9^Some differences with the results shown in Tables 1 and 2 are due to the different choices for the cut 
in SNR for the parameter estimation analysis; in Section 3 (to which the result of this appendix refer) was 
used SNR > 12, while in Section 4 the threshold was lowered to SNR > 8 because of the increased statistical 
significance provided by to the coincidence with an electromagnetic counterpart. As can be seen from Table 42, 
passing from one cut to the other, the number of detections changes by more than factor of 2, in particular for 
the HF-only configurations. Part of the discrepancy is also due to the different inversion procedure adopted in 
the two sections, as already outlined in footnote 9, with some events being discarded in the results presented 
here and in Section 3, while their Fisher matrices are regularized in the ones reported in Section 4, resulting 


in higher numbers. 
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Configuration Ad; /dr € 0.1 | Adr/dr € 0.01 | ARo% € 50 deg? | ANgox € 10 deg? 
A-10km-HFLF-Cryo 10 969 28 6064 914 
A-15km-HFLF-Cryo 17321 77 10 470 2273 
2L-15km-45°-HFLF-Cryo 22 237 202 10304 2124 
2L-20km-45?-HFLF-Cryo 28 801 365 14920 3648 
2L-15km-0?-HFLF-Cryo 13 865 79 3030 374 
2L-20km-0°-HFLF-Cryo 17 008 144 4706 608 
A-10km-HF 3919 6 2409 281 
A-15km-HF 8083 26 5156 817 
2L-15km-45°-HF 11193 56 5263 835 
2L-20km-45°-HF 16 155 113 8448 1566 
2L-15km-0°-HF 4111 17 1054 120 
2L-20km-0°-HF 9693 57 2936 362 
A-10km-HFLF-Cryo+CE-40km 80 676 2901 69 268 29 924 
2L-15km-45°-HFLF-Cryo+CE-40km 82 358 4301 73 164 36 457 
2L-15km-0°-HFLF-Cryo+CE-40km 64471 2995 57 497 25 182 
A-10km-HFLF-Cryo+2CE 101912 5250 90 889 46 744 
2L-15km-45°-HFLF-Cryo+2CE 104 289 7006 95 387 54640 
2L-15km-0°-HFLF-Cryo+2CE 93 832 5824 85 267 45 429 
LVKI-O5 767 1 1607 599 
Table 40: Same as Table 39, for BBH sources, with cuts on the relative error on the luminosity 
distance and on the 90% sky localisation area. 
Configuration AM./Me € 1073 | AMe/ Me < 107 | Axı < 0.05 | Axı € 0.01 
A-10km-HFLF-Cryo 48 922 4549 27 877 2811 
A-15km-HFLF-Cryo 64 469 7703 41 612 4856 
2L-15km-45°-HFLF-Cryo 58 371 6456 35 943 3958 
2L-20km-45?-HFLF-Cryo 67999 9073 45 666 5706 
2L-15km-0°-HFLF-Cryo 57 330 6472 33 236 3653 
2L-20km-0°-HFLF-Cryo 63 154 8279 40 068 4935 
A-10km-HF 21146 1580 11715 1438 
A-15km-HF 32 643 2818 19 956 2564 
2L-15km-45°-HF 28 442 2405 16 382 2025 
2L-20km-45°-HF 36 969 3547 23 205 2940 
2L-15km-0°-HF 25 863 2146 13 669 1652 
2L-20km-0°-HF 38 537 3740 23 065 2935 
A-10km-HFLF-Cryo+CE-40km 73 785 8901 54 789 6649 
2L-15km-45°-HFLF-Cryo+CE-40km 79 425 11187 60 709 7988 
2L-15km-0°-HFLF-Cryo+CE-40km 77772 10 885 57471 7519 
A-10km-HFLF-Cryo+2CE 79 895 10 523 62 826 8226 
2L-15km-45°-HFLF-Cryo+2CE 84 507 12 703 68 382 9557 
2L-15km-0°-HFLF-Cryo+2CE 82473 12 307 65 287 9012 
LVKI-O5 78 1 155 20 


Table 41: Same as Table 39, for BBH sources, with cuts on the relative error on the detector-frame 
chirp mass and on the error on the spin magnitude of the primary object. 
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Configuration SNR > 8 | SNR > 12 | SNR > 50 | SNR > 100 | SNR > 150 
A-10km-HFLF-Cryo 107 902 36 985 458 57 19 
A-15km-HFLF-Cryo 213 583 89910 1206 159 38 
2L-15km-45?-HFLF-Cryo 190 528 77 458 1052 134 33 
2L-20km-45°-HFLF-Cryo 275 595 129 821 2018 243 64 
2L-15km-0°-HFLF-Cryo 192 030 78 675 1040 136 33 
2L-20km-0°-HFLF-Cryo 274 395 132 486 2048 250 65 
A-10km-HF 44 113 13410 166 18 9 
A-15km-HF 116 349 41 181 516 55 17 
2L-15km-45?-HF 101 550 34956 447 52 15 
2L-20km-45°-HF 176 396 70 441 961 115 32 
2L-15km-0°-HF 103 539 35817 443 57 17 
2L-20km-0°-HF 184 799 74 805 989 124 37 
A-10km-HFLF-Cryo+CE-40km 348 434 177 925 2836 312 87 
2L-15km-45°-HFLF-Cryo+CE-40km 392 680 212 260 3677 418 116 
2L-15km-0°-HFLF-Cryo+CE-40km 402 234 220 023 3770 414 119 
A-10km-HFLF-Cryo+2CE 406 630 220 725 3961 436 120 
2L-15km-45°-HFLF-Cryo+2CE 442 526 252 136 4900 559 152 
2L-15km-0°-HFLF-Cryo+2CE 448 798 258 615 4974 531 162 
LVKI-O5 250 71 3 0 0 


Table 42: Same as Table 39 for BNS sources. 


Configuration Adz /dr € 0.3 | Adr/dr € 0.1 | AQgoy < 100 deg? | AQgox, € 10 deg? 
A-10km-HFLF-Cryo 748 52 184 8 
A-15km-HFLF-Cryo 1756 153 479 23 
2L-15km-45°-HFLF-Cryo 4328 479 559 25 
2L-20km-45°-HFLF-Cryo 7821 919 1028 43 
2L-15km-0°-HFLF-Cryo 774 48 293 12 
2L-20km-0°-HFLF-Cryo 1499 104 565 23 
A-10km-HF 4 1 4 0 
A-15km-HF 7 1 11 1 
2L-15km-45?-HF 126 12 11 0 
2L-20km-45?-HF 262 22 24 1 
2L-15km-0°-HF 20 1 11 1 
2L-20km-0°-HF 28 2 24 1 
A-10km-HFLF-Cryo+CE-40km 32 053 4100 54 994 2427 
2L-15km-45°-HFLF-Cryo+CE-40km 45 252 7949 75 828 3838 
2L-15km-0°-HFLF-Cryo+CE-40km 16 999 2079 29 821 1515 
A-10km-HFLF-Cryo+2CE 72335 13630 112 705 6570 
2L-15km-45°-HFLF-Cryo+2CE 89 877 19 129 145 272 9841 
2L-15km-0°-HFLF-Cryo+2CE 78 798 14909 125 640 7592 
LVKI-O5 12 1 51 31 


Table 43: Same as Table 40 for BNS sources. 


— 152 — 


Configuration AMe/Me x 107? | AMe/Me < 1074 | AMA < 0.1 | AA/A x 0.05 
A-10km-HFLF-Cryo 35197 18401 1040 96 
A-15km-HFLF-Cryo 84835 39 607 2783 227 
2L-15km-45°-HFLF-Cryo 68 391 36 645 2463 200 
2L-20km-45°-HFLF-Cryo 115 695 59051 5189 386 
2L-15km-0°-HFLF-Cryo 67 023 29 813 2225 179 
2L-20km-0°-HFLF-Cryo 113726 49 445 4703 374 
A-10km-HF 10831 687 248 21 
A-15km-HF 31663 1548 667 45 
2L-15km-45°-HF 25 058 2033 634 47 
2L-20km-45°-HF 50 464 3644 1494 90 
2L-15km-0°-HF 22 839 1463 445 36 
2L-20km-0?-HF 42 731 2292 780 72 
A-10km-HFLF-Cryo+CE-40km 171997 46 008 2669 243 
2L-15km-45°-HFLF-Cryo+CE-40km 197631 66 214 4753 400 
2L-15km-0°-HFLF-Cryo+CE-40km 198171 63 219 4488 371 
A-10km-HFLF-Cryo+2CE 216 875 58 522 3847 337 
2L-15km-45°-HFLF-Cryo+2CE 244 198 80 406 6319 535 
2L-15km-0°-HFLF-Cryo+2CE 250033 79 746 6106 505 
LVKI-O5 54 2 2 0 


Table 44: Same as Table 41, for BNS sources, with cuts on the relative error on the detector-frame 
chirp mass and on the relative error on the tidal deformability parameter A. 
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D Correlation between parameters for typical events 


In this appendix we discuss the correlation between parameters, in the BBH parameter 
estimation. This is instructive to obtain a physical understanding of some of the results 
presented in the main text. In particular, we wish to understand why the 15 km triangle and 
the 15 km 2L at 45? have performances very similar for the reconstruction of all parameters, 
except for luminosity distance, where the 15 km 2L at 45? is clearly superior, see Fig. 4 and 
Tables 40, 41. To this purpose, we consider in detail some selected events (the correlation 
patterns that we will discuss are in fact generic). We select some BBH event with very high 
SNR, so that the Fisher matrix analysis is expected to be more reliable. As a first example we 
consider, in our ensemble of detected events, a ‘light’ BBH event (that we denote as “event 
1”) with source-frame masses m; ~ 7.9Mo, m2 = 7.6Mo, at a distance dr, ~ 0.30 Gpc 
(z ~ 0.065). Due to its close distance, its SNR is very high, and is 513 in the 15 km triangle 
and 478 in the 15 km 2L at 45°. 

In Fig. 71 we show the correlation between some parameters. For readability, within 
the 15-dimensional parameter space that we use for BBHs, we restrict to the most interesting 
correlations, and we compare the results for the 15 km 2L at 45? with the 15 km triangle, 
both in their full HFLF-cryo configuration. The interesting point is that, in the correlation 
of various parameters with the luminosity distance, the contours for the 15 km triangle (red) 
are in general more tilted than the blue contour referring to the 15 km 2L at 45?. In Fig. 71 
we see this in the correlation with all the parameters shown. This implies that, when one 
marginalizes over these parameters, the same error on these parameters induces a larger 
marginalized error on dr. We see indeed that, for all parameters except dr, for this event 
the marginalized errors are the same for the 15 km triangle and the 15 km 2L at 45?, while 
the marginalized distribution for dr, is narrower for the 15 km 2L at 45?. Observe that this 
happens despite the fact that, for this event, the SNR in the triangle was higher than in the 
15 km 2L at 45°. 

Fig. 72 shows the analogous results for an event (“event 2”) with source-frame masses 
mı ~ 29.0Mo, m» = 24.6Mo, at a distance dr, œ 1.19 Gpc (z ~ 0.23). Its SNR is again 
very high, 424 in the 15 km triangle and 318 in the 15 km 2L at 45?. As we reminded in 
Section 2, the differences in the SNR between configurations, alone, is not a good indicator 
of the relative performances of parameter estimation (the most obvious example being that, 
on an ensemble of events, a 2L parallel configuration has higher SNR but worse angular 
localization than a 2L at 45°). This is also clearly visible from this plot: in this case, despite 
the larger SNR in the 15 km triangle, most parameters are significantly better estimated by 
the 15 km 2L at 45?. We also see that, again, the correlation contours of dr, with various 
parameters are more tilted in the triangle case, contributing to enlarging the marginalized 
posterior of dr. This effect is also particularly evident, for this event, in the correlation 
between 0 and 4. 
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Figure 71: A corner plot showing the correlations between various parameters for a BBH event 
with light masses and very close distance (*event 1", see the text). 
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Figure 72: As in Fig. 71 for a BBH event with heavier masses and greater distance (“event 2”, see 
the text). 
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Acronyms 


3G Third Generation. 2 


AGWB Astrophysical Gravitational Wave Background. 51 


ASD Amplitude Spectral Density. 6 


BAO Barionic Acoustic Oscillation. 98 
BBH Binary Black Hole. 1 


BNS Binary Neutron Star. 1 


CBC Compact Binary Coalescence. 5 

CDO Compact Dark Object. 126 

CE Cosmic Explorer. 2 

CGWB Cosmological Gravitational Wave Background. 51 
CIB Cosmic Infrared Background. 52 

CMB Cosmic Microwave Background. 52 

CSD Cross-power Spectral Density. 61 


CW Continuous Wave. 117 


DE Dark Energy. 95 


DM Dark Matter. 87 


ECO Exotic Compact Object. 72 
EM Electromagnetic. 39 
EoS Equation of State. 39 


ET Einstein Telescope. 2 


FAR False Alarm Rate. 128 
FOPT First-Order Phase Transition. 114 
FoV Field of View. 24 


FRW Friedmann-Robertson-Walker. 100 


GR General Relativity. 15 


GRB Gamma Ray Burst. 3 
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GW Gravitational Wave. 1 


GW'TC Gravitational Wave Transient Catalog. 88 


HF High-Frequency. 2 


HMNS Hypermassive Neutron Star. 83 


IMBH Intermediate Mass Black Hole. 14 
IMR Inspiral Merger Ringdown. 107 
ISB Instrument Science Board. 4 


ISCO Innermost Stable Circular Orbit. 74 
KN Kilonova. 39 


LF Low-Frequency. 2 

LSST Large Synoptic Survey Telescope. 95 

LVC LIGO-Virgo Collaboration. 1 

LVKI LIGO Hanford, LIGO Livingston, Virgo, KAGRA, LIGO India. 5 


NN Newtonian Noise. 60 
NR Numerical Relativity. 83 


NSBH Neutron Star-Black Hole. 1 


ORF Overlap Reduction Function. 148 


OSB Observational Science Board. 3 


PBH Primordial Black Hole. 87 
PLS Power Law Sensitivity. 3 
PN Post Newtonian. 10 


PSD Power Spectral Density. 61 
QNM Quasinormal Mode. 68 


SFH Star Formation History. 58 
SGWB Stochastic Gravitational Wave Background. 51 
SMNS Supramassive Neutron Star. 83 


SN Supernova. 98 
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SNR Signal-to-Noise Ratio. 7 


SXI Soft X-ray Imager. 47 


tCW Transient Continuous Wave. 117 


TOV Tolman—Oppenheimer—Volkoff. 79 
UV Ultra Violet. 42 


VRO Vera Rubin Observatory. 48 


XGIS X/Gamma-ray Imaging Spectrometer. 44 
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